things  I  have  written  about 
elsewhere  #201 80202 

Privatezen 


Privatezen 


This  was  originally  published  on  the  Who's  On  First 

weblog  https://whosonfirst.org/biog/20i8/02/02/privatezen/  ,  in 
February  2018. 


Way  back  in  2012  I  started  working  on  a  project  called 
Privatesquare  https://straup.github.io/privatesquare/  . 

At  the  time  I  was  doing  a  lot  of  work  around  the  idea  of  personal 
archiving  https : / /WWW. aaronland . info/weblog/ 2012/02/14/in 
centivize/#pda2  0i2  and  Privatesquare  was  originally  conceived 
as  a  simple  backup  of  my  Foursquare  check-ins. 

At  one  point  I  started  thinking:  I  don't  care  if  Foursquare  knows  I 
went  to  the  bar  around  the  corner,  but  maybe  I'm  not  so  keen  on 
them  knowing  I  went  to  the  drug  store.  At  the  same  time  maybe  I  am 
interested  in  having  a  record  of  that.  So  what  began  as  a  simple 
unidirectional  archiving  project  morphed  in  to  a  tool  where  I  would 
check-in  to  Privatesquare  first  and  then  optionally  tell  Foursquare. 

Fun  fact:  Without  Privatesquare  the  IDs  in  Who's  On  First  might 
have  been  completely  different  but  that  is  an  entirely  other 
Story  https : //www. aaronland . info/weblog/2012/12/ 01 /coffee 
-and-wif  i/#timepixeis  ,for  another  day. 


000000^000000 000 00 000 00 00000 00 00*0000 00 00 00 

00000*00 

000*00*00000000*00*00 

function  vonucs_statoofmind_vcnucs ( ) { 

$ venues  ■  array ( 

array ( 'id' 

■> 

'51783653' , 

'name ' 

■> 

'in  a  car' ) , 

array! ' id ' 

-> 

'51783655' , 

'name ' 

■> 

' in  a  cab' ) , 

array ( ' id ' 

■> 

'52204087' , 

'name ' 

■> 

' on  a  bike ' ) , 

array!  id' 

■> 

'51783657' , 

'name ' 

■> 

'on  a  plane' ) , 

array! ' id ' 

■> 

'51783659' , 

'name ' 

■> 

'on  a  train' ) , 

array ( ’ id ' 

■> 

'51783661' , 

'name ' 

■> 

' on  a  boat ' ) , 

array! ' id ' 

■> 

'52204085' , 

'name ' 

-> 

’ on  the  water ' ) , 

array! ' id ' 

■> 

'51866911' , 

'name ' 

«> 

'in  transit' ) , 

array!  id' 

■> 

'68234601' , 

'name ' 

■> 

' in  a  meeting ' ) , 

array! ' id ' 

■> 

'68234603' , 

'name ' 

■> 

'on  a  telco’ ) , 

array ( ’ id ' 

■> 

'52204033' , 

'name ’ 

-> 

'wish  you  were  here'). 

array! 'id' 

-> 

'52204035' , 

'name ' 

■> 

'kill  me  now' ) , 

array! ' id ' 

-> 

'52204045'  , 

'name ' 

■> 

' EXTERMINATE  1111'), 

array ( ' id ' 

>; 

■> 

'51866909' , 

'name ' 

■> 

' so  confused' ) , 

return  $ venues; 

) 

00 000 00 00 000 000 00 000 00 0000\ 

*00000000000000 

00*00000 

f 0000000000000000 

Privatesquare  eventually  grew  more  sophisicated  allowing  you  to 
"check-in"  to  places  not  defined  in  Foursquare.  For  example,  old 
buildings  in  New  York  City  courtesy  the  New  York  Public 
Library  https : / / git hub . com/NYPL/ gazetteer/wiki/ApiDocs 
and  so-called  "states  of  mind".  It  also  added  the  ability  to  record 
travel  and  to  record  things  you  liked  or  didn't  like  in  the  Atlas  of 
Desire  http: / /nearfuturelaboratory. com/2012/ 09/ 04/the- 


at las -of -desire/ 


Privatesquare  was  always  a  mornings-and- weekends  project  and 
used  a  variety  of  external  services  to  get  thing  done: 

Foursquare  https://developer.foursquare.com/  for  venues 
and  search;  Stamen  http :  / / maps .  stamen .  com/  for  map  tiles; 
The  Flickr  API  https://www.flickr.com/services/api  for 
geocoding  cities. 

The  first  week  I  started  at 

Mapzen  https://en.wikipedia.org/wiki/Mapzen  ,  in  2015, 1 
remembering  thinking:  I  wonder  if  I  can  swap  out  each  one  of  third- 
party  services  used  by  Privatesquare  with  an  equivalent  Mapzen 
service?  The  answer,  at  the  time,  was  "No". 

It  was  a  useful  reminder  of  the  work  we  had  set  out  for 
Ourselves  https : / / archive . org/ search . php? 
query=mapzen%2 Oweblog  . 


Fast  forward  to  the  spring  of  2017  and  I  had  begun  to  experiment 
with  small  bespoke  Electron-based 

applications  https :  /  /electron js  .org/  ,  specifically  a  tool  that 
a  person  could  use  to  explore  and  test  the  Who's  On  First 
API  /blog/2017/04/04/whosonf  irst-api/  .Electron 
applications  are  written  in  HTML  and  CSS  and  Javascript  and  then 
bundled  up  and  compiled  in  to  a  binary  application. 


I  continue  to  be  interested  in  Electron  apps  because  they  afford  the 
ability  to  more  easily  craft  something  that  looks  and  feels  like  a 
native  application,  across  different  platforms,  without  a  lot  of  the 
boring  grunt-work  that  comes  with  writing  native  applications. 

After  I  finished  the  Who's  On  First  API 

Explorer  /blog/ 2017/04/ 2  8/whosonf irst-api-explorer/ 
application  I  remember  thinking  back  to  2015  and  wondering  if  I 
could  finally  swap  out  all  of  the  third-party  services  in  Privatesqure 
with  an  equivalent  Mapzen  service. 


The  answer  this  time  was  "Yeah,  probably..." 


Instead  of  updating  the  Privatesquare  web 

application  https://github.com/straup/privatesquare/  I 
decided  to  build  a  dedicated  Electron  application  from  scratch.  I  did 
this  for  a  few  reasons: 

•  I  didn't  want  the  extra  hassle  of  running  another 
server  with  user  accounts. 

•  I  also  didn't  want  to  store  any  user  data  on  a  remote 


server. 


•  I  wanted  to  see  whether  the  model  that  Firefox  team 
had  developed  for  their  Sync 

product  https : / /blog. mozilla.org/warner/2014/ 0 
5/23/the-new-sync-protocol/  could  be  applied  to 
check-ins  and  personal  location  tracking. 

•  I  wanted  to  think  about  building  an  application  that 
could  be  developed  and  deployed  across  a  number  of 
platforms  and  devices ,  each  of  which  might  have  a 
purpose-built  UI  but  all  of  which  shared  the  same 
underlying  data  model,  in  this  case  a  simple  SQLite 
database. 

Everything  in  Privatesquare  is  private  to  an  individual.  There  is  no 
way  to  "share"  a  place  or  a  list  or  really  anything  you  do  in  the 
application.  That  is  by  design  but  even  though  you  could  run  your 
own  copy  of  Privatesquare  (and  a  few  people  did)  if  you  used  my 
instance  then  your  data  was  stored  alonside  everyone's  else  data  in  a 
shared  database  that  someone  else  (me)  controlled.  So  it  is  "private" 
in  as  much  as  you  trust  me. 

That's  a  truism  bordering  on  a  platitude  but  in  2018  trust  in  systems, 

if  not 

individuals  https : //WWW. aaronland . info/weblog/2012/10/ 08/s 
igns/#stories  ,  increasingly  seems  like  it  is  in  short  supply.  Every 
day  bring  news  of  some  previously  unknown  existential  flaw  in  the 


fundamental  principles  of 

Computers  https  :  / /WWW.  raspberrypi .  org/blog/why- 
r aspber ry-pi- isnt- vulner able- to- spec tre-or-meltdown/  and 
computer  networks  -  aka  "the  cloud"  -  so  that  even  if  you  implicitly 
trust  an  operator  or  a  service  the  guarantee  of  that  trust  feels  like  it  is 
called  in  to  question. 


So  what  does  it  mean  to  not  store  sensitive  data  (whether  it  is 
sensitive  or  just  feels  sensitive  amounts  to  the  same  thing)  on  a 
remote  server?  The  problem  of  course  is  that  people  now  have 
multiple  computers,  big  and  small,  and  have  come  to  expect  that  all 
those  "devices"  can  and  will  can  in  sync  with  one  another. 

One  approach  to  mitigating  this  problem  is  simply  encrypting  the 
data  in  transit,  and  at  rest,  between  devices.  This  is  how  Firefox 
synchronizes  things  https://github.com/mozilla/fxa-auth- 
server/wiki/onepw-protocoi  like  browser  preferences  and 
bookmarks  between  a  user's  desktop  computer  and  their  mobile 
phone. 


Ultimately  there  is  still  a  server  brokering  the  communication 
between  devices  and  there  is  a  lot  of  math  and  cryptography  and 
there  are  still  user  accounts  and  a  bunch  of  ways  you  can  shoot 
yourself  in  the  foot  and  never  be  able  to  decrypt  your  data  (that's 
what's  going  on  in  the  scary  screenshot  above)  but  the  thing  you  get 
in  return  for  all  that  hoop-jumping  is  that  Mozilla,  the  parent 
company  of  Firefox,  doesn't  know  what  you're  bookmarking. 


I  don't  belive  what  Firefox  is  doing  is  a  perfect  solution,  by  any 
means,  but  it  feels  like  forward  motion  and  is  at  least  "better  than 
yesterday".  In  as  much  as  "check-ins"  are  just  "bookmarks"  by 
another  name,  maybe  there's  something  to  learn  from  people  already 
working  on  this  problem. 


It's  also  worth  a  quick  tangent  to  mention  differential 

privacy  https : / /en.wikipedia.orq/wiki/Dif ferential  priv 

acy  and  Google's  federated 

learning  https : / /research. googleblog.com/2017/ 04/federate 
d-learning-collaborative . html  here. 


When  user  data  is  encrypted  at  the  service  provider  level  it  is 
problematic  for  companies  that  rely  on  aggregating  that  data  (and 
hence  be  unencrypted)  in  to  something  they  can  resell,  whether  its 
advertising  or  genuinely  useful  secondary  services.  Nevermind  the 
former  but  I  choose  to  believe  there's  an  argument  to  be  made  in 
favour  of  the  latter. 

What  I  think  is  valuable  about  projects  like  differential  privacy  and 
federated  learning  is  the  idea  there  might  be  a  way  to  say  to  people: 
We  care  intensively  about  whatever  it  is  you're  "doing"  but  we  don't 
actually  care  about  you,  personally  or  individually.  This  is  interesting 
to  me  for  projects  like  Who's  On  First  and  Privatesquare/zen  when  it 
comes  to  things  like  validating  data  but  more  generally  I  think  there 
is  a  need  and  real  value  for  people  to  have  a  way  to  participate  in 
collaborative  efforts  with  some  measure  of  ambiguity,  if  not 
anonimity. 


Simson  Garfinkel's  presentation  on  Modernizing  the  Disclosure 
Avoidance  System  for  the  2020 

CenSUS  https ://youtu.be/G3voy0MuoN0?t=lh54m2  0  for  the 


Census  Scientific  Advisory  Committee  (CSAC),  last  year,  is  a  good 
discussion  of  some  of  the  benefits  and  trade-offs  in  this  approach. 


Lots  of  people  have  written  about  and  demonstrated  the  value  of 
SQLite  https :  // sqiite.org/  databases,  none  more  eloquantly 
than  Paul 

Ford  https : / / newrepublic . com/ article/ 12 4425 /dreamed- 
perf  ect-database  .  Similarly  Simon's  Willison's 
Datasette  https : / / simonwiiiison.net/2017/Nov/13/datasett 
e/  project  is  a  good  example  of  how  SQLite  databases  can  make 
"simple  things  and  hard  things  possible". 

Even  before  that,  though,  SQLite  has  been  used  under  the  hood  for 
years  by  a  wide  array  of  applications  and  services.  SQLite  databases 
and  small  and  portable  and  self-contained  (a  single  file  on  disk  that 
you  could  send  to  someone  as  an  email  attachment)  and  while  they 
still  require  a  user  to  install  dedicated  software  to  work  with  the  data 
those  tools  are  readily  available  in  2018. 

And  lots  of  different  tools  can  read  and  write  SQLite  databases. 
Aside  from  the  mechanical  considerations  for  using  SQLite  (only 
one  file  to  encrypt  and  decrypt  and  send  across  the  wire)  I  like  the 
fact  that  the  raw  data  that  makes  the  application  unique  and  useful  to 
an  individual  is  something  which  is  also  accessible  to  browse, 


manipulate  or  do  whatever  someone  wants  with  outside  of  that 
application. 


It  provides  a  useful  check  and  a  separation  of  concerns  that  forces  an 
application  to  succeed  on  its  own  merits  and  not  because  it  has  the 
key  to  all  your  data. 


One  day,  after  showing  some  work-in-progress  of  my  new  Mapzen- 
enabled  Privatesquare  -  aka  "Privatezen"  -  at  the  office  a  colleague 
asked  if  this  was  an  official  Mapzen  project.  It  was  not  and  never 
was.  I  would  have  enjoyed  making  it  one  but  we  already  had  million 
other  things  on  the  go  so  it  was  a  thing  that  I  would  poke  at  in  45- 
minute  increments  over  coffee  in  the  morning  before  going  to  do  the 
"real  work"  of  Who's  On  First. 


It  was  and  has  remained,  for  now,  a  tool  uniquely  tailored  to  me 
because  I  assume  I  am  the  only  one  who  can  see  past  all  the  short¬ 
cuts  and  gotchas  to  bother  using  the  thing. 

It  (bookmarking  places  or  "check-ins"  or  whatever  you  want  to  call 
it)  is  a  thing  I've  always  missed  since  I  stopped  running  a  hosted 
version  of  Privatesquare.  At  the  same  time,  I've  always  felt  that  there 
was  no  point  in  doing  Who's  On  First  unless  you  could  actually  do 
something  with  the  data  so  this  project  was  a  good  way  to  kick  the 
tires. 


"Privatezen",  the  application,  is  organized  around  six  categories: 
Visits,  Neighbourhoods,  Cities,  Lists,  #feelings  and  Trips.  All  of 
these  categories  hold  hands  with  each  other  meaning  you  should  be 
able  to  look  for  all  the  visits  in  a  particular  neighbourhood  or  all  of 
the  lists  that  have  visits  in  a  particular  city  or  #feelings  associated 
with  trips  and  so  on. 

" #feelings "  used  to  be  " desires "  (as  in  Private  square's  Atlas  of 
Desire  http: / /nearf uturelaboratory.com/2012/ 09/ 04/the- 
at las -of -desire/  )  but  a  hashtag  felt  more  in  keeping  with  the 
times... 

Here's  what  a  "visit"  looks  like: 


9  9  Who's  On  First  Bookmarks 


CITIES  HOODS  VISITS 


Hut's  Hamburgers  a 

Market  District,  Austin,  Texas,  United  States 
1 939  to  the  present 

visits  T 

You  said  "i've  been  there"  on  or  around  201 7-06-23T1 9:09:1 5.91  OZ  ® 


details  * 


wof:id 

638608585 

wof:parent_id 

85893535 

wof:name 

Hut's  Hamburgers 

wofrplacetype 

venue 

wof:co  untry 

US 

wof-.repo 

whosonfirst-data-venue-us-tx 

edtfiinception 

1939 

Chief  among  the  long  list  of  short-cuts  and  gotchas  is  the  fact  that 
search  has  never  worked  very  well,  for  three  reasons. 


First  lots  of  venues  (in  Who's  On  First)  hadn't  been  indexed  in 
Mapzen  Search  yet.  Second,  we  were  using  the 

mapzen.js  https://github.com/nextzen/mapzen.js  library  for 

map  display  and  the  Pelias 

API  https  : / / git hub . com/ pel ias /pelias - 

doc/blob/master/ search. md  for  search.  Although  the  Who's  On 


First  API  implemented  a  Pelias-API  compliant 

endpoint  https : / /github.com/whosonf irst/whosonf irst-www- 
api/blob/master/www/ include/lib  api  whosonfirst  pelias . p 

hp  it  was  always  buggy  and  those  fixes  just  never  made  it  to  the  top 
of  the  stack.  Third,  I  just  never  got  around  to  implementing  a  good 
generic  list  view  interface  for  things  like  search  results. 


Instead,  because  I  was  the  only  person  ever  using  this  tool,  I  suffered 
the  hassle  of  looking  up  a  venue  ID  (usually  from  the 
Spelunker  /speiunker/  )  and  then,  using  the  special  syntax  id : 
{WHOSONFIRST_ID} ,  would  paste  it  to  the  search  box  on  the  top 
right. 


From  there  the  application  would  look  up  the  venue  details  and 
display  them  with  a  handy  map  at  which  point  you  could  save  your 
"visit'1  by  saying  nothing  more  specific  than  "I’ve  been  there"  and  the 
UI  would  update  itself  accordingly. 


There  are  couple  things  to  note:  There  was  nothing  venue  specific 
about  this  interface.  You  could  have  saved  any  place  in  Who's  On 
First.  There  was  also  nothing  specific  about  signaling  when  a  visit 
took  place. 


const  feelings  =  { 

0:  "i've  been  there", 

1:  "i  was  there", 

2:  "i  want  to  go  there", 
3:  "again  again", 

4:  "again". 


}; 


5:  "again  maybe", 

6:  "again  never", 

7 :  "meh" , 

8:  "i  would  try  this". 


Although  there  is  a  timestamp  indicating  when  a  visit  was  added  to 
the  database  the  application  doesn't  assume  that  you  are  recording 
things  in  the  moment.  In  fact  you  can  use  the  application  to  signal 
places  that  you  want  to  visit. 


SAVE 


New  Museum  a 

235  Bowery,  New  York,  NY  10002, 

Bowery,  New  York,  New  York,  United  States 

the  sometime-past  to  the  present 


visits  ♦ 
lists  i 

nearby  transit  T 

•  bowery 

j  z 

•  bowery 

m  j  z 

•  bowery/prince  st 

ml  03 

•  bowery/kenmare  st 

ml  03 

•  bowery/delancey  st 

ml  03 


I  was  also  trying  to  integrate  as  many  of  the  various  Mapzen 
services,  not  so  much  for  the  sake  of  it  but  because  they  were  all 
useful  and  to  understand  how  they  fit  together  and  where  the  stress 
points  remained. 

Once  a  Who's  On  First  place  was  loaded  on  to  the  map  we  would 
call  the  Transitland  API  https :  /  /transit .  land/  for  nearby 
transit  stops.  Like  a  lot  of  features  this  never  moved  beyond  the  basic 
prove-it-works-with-a- list- view  stage  on  to  a  more  refined  and  useful 
user  interface. 

Routing,  and  something  like  Dan's  All  of  the 
Places  /blog/2016/ 0  8/24/ aii-of-the-piaces/  experiments 
from  2016,  were  also  on  the  list  but  there  was  never  enough  time  for 
all  of  the  things... 


Who's  On  First  Bookmarks 


VISITS 

Search  for  a  place  in  W 

SEARCH 

Edmonton  o 
Jasper  Notional 


9 

Vancouver 

* 

San  Francisco 


NORTH 

AMERICA 


Los  Angeles  °  Phoenix 


CALiFOinrrA 

€  Mjczlt.i.  Oaji'S'.-iotV^.  ano  u:l-tvs  |  L^il:: 


ill  {'lerror":{''code":432,"message":"Missing  'id'  parameter"}, "stat":"error"} 


Tojo's  Restaurant,  in  Vancouver 

You  said  "again  again"  on  or  around  20 1 7-06-24T01 :04:40. 1 74Z  0 

Jae  Bu  Do,  in  Los  Angeles 

You  said  "i  want  to  go  there"  on  or  around  2017-O6-24TO0:57:45.970Z  0 

Lardo,  in  Portland 

You  said  "again"  on  or  around  2017-06-24T00:55:30.191Z  0 

Robin,  in  San  Francisco 

You  said  "i  want  to  go  there"  on  or  around  2017-06-23T19:57:26.558Z  0 

Hotel  Delmano,  in  New  York 

You  said  "again  again"  on  or  around  2017-06-23T19:38:10.928Z  0 

El  Papi  Street  Tacos,  in  Laurel 

You  said  "i  want  to  go  there"  on  or  around  2017-06-23T19:37:41.574Z  0 

Angel's  Share,  in  New  York 

You  said  "again  again"  on  or  around  201 7-06-23T1 9:34:40.1 83Z  0 

Mimi  Cheng's,  in  New  York 


The  default  view  when  you  launch  the  application  is  to  show  your 
most  recent  visits  in  reverse  chronological  order  with  a  handy  map. 
Generally  the  rule  of  thumb  was  to  add  "a  handy  map"  whenever 
possible. 


The  other  rule  was  to  make  every  "first  class  object"  in  every  view 
clickable.  Here's  what  I  would  see  when  I  clicked  New  York 
City  https://places.whosonfirst.org/id/  85977539/ 

As  you  can  see  from  the  many  dots  on  the  map  and  only  three  visible 
"visits" ,  on  the  right,  another  of  the  many  neglected  UI  details 
included  how  best  to  fit  everything  in  to  a  finite  amount  of  space  and 
signaling  when  a  given  element  overflowed  the  viewport. 


Who's  On  First  Bookmarks 


Canada 


San  Francisco 

you've  mentioned  stuff  here  21  times  spanning  15  neighbourhoods 
you  said  "again"  8  times  and  "i  want  to  go  there"  7  times  and 
"i've  been  there"  5  times  and  "meh”  once 

New  York 

you've  mentioned  stuff  here  10  times  spanning  9  neighbourhoods 
you  said  "again  again"  5  times  and  "again"  4  times  and 

"i  want  to  go  there"  once 

Montreal 

you've  mentioned  stuff  here  10  times  spanning  6  neighbourhoods 
you  said  "i  want  to  go  there"  4  times  and  "again"  3  times  and 

"i've  been  there"  twice  and  "again  again"  once 

Los  Angeles 

you've  mentioned  stuff  here  2  times  in  one  neighbourhood 
you  said  "i  want  to  go  there"  once  and  "again"  once 

Berlin 

you've  mentioned  stuff  here  2  times  In  one  neighbourhood 


Q  Mapzen,  OpenStreetMap.  and  others  \  Leaflet 


you  said  "again  again"  once  and  "again"  once 


ti  {"error":{"code":432,''message":''Missing  'id'  parameter”}, ''stat":"error") 


Here  are  the  top  cities  that  I  had  recorded  visits  for.  These  were 
grouped  again  by  #feelings  about  the  places  that  I  had  visited  in  that 
city.  There  are  likewise  views  for  neighbourhoods  which  was  often 
very  useful  when  trying  to  figure  out  where  to  meet  someone  for 
dinner  and  then  immediately  drawing  a  complete  blank  when  asked 
to  pick  a  spot. 


again  again 

in  Los  Angeles 
desires  ? 

Getty  Center,  in  Los  Angeles 

You  said  "again  again"  on  or  around  2017-06-26T23:03:13.530Z  <s> 

In-N-Out  Burger  ,  in  Los  Angeles 

You  said  "again  again"  on  or  around  2017-06-29T21:05:22.599Z  <S> 


Credit  for  the  basic  model  of  grouping  things  in  to  buckets  labeled 
"again",  "again  again",  "again  maybe"  and  "again  never"  goes  to 
Chris  Heathcote  https://mobile.twitter.com/antimega  . 
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NC^York  City 


jVlushW'g*'*- 
Nl yrtle  *ve 


BEDFORD-STUWESANT 


Atlantic  Avr- 


•  Eastern- 
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Hoboken 


Drunk  in  NYC 

•  Weather  Up  s 

.  B61  s 

•  Gowanus  Yacht  Club  ® 

•  Marie's  Crisis  Cafe  s 

•  Hotel  Delmano  s 

•  Angel's  Share  s 

•  Big  Bar  <s> 

•  Little  Branch  s 

•  Dutch  Kills  s 

•  Clover  Club  is 


It  is  possible  to  create  lists  and  like  the  transit  work,  above,  they 
never  matured  much  beyond  the  "prove  it's  possible  with  a  list  of 
items"  stage. 


Who's  On  First  Bookmarks 


NEIGHBOURHOODS 


cQop 


Montreal  © 

201 7-1 0-09  to  201 7-1 0-1 3  ® 

NAGS  2017 

confirmed 

New  York  City  © 

2017-09-11  to  201 7-09-1 5  ® 

Mapzen  East 
tentative 

Montreal  © 

201 7-06-30  to  201 7-08-06  ® 

confirmed 

New  York  City  © 

201 7-05-25  to  201 7-05-29  ® 

Micah  and  Amanda's  wedding 
confirmed 

Valencia  © 

201 7-05-22  to  201 7-05-22  ® 
All  Calatrava,  all  the  time 


You  can  add  trips  to  places  which  is  useful  when  combined  with  the 
"I  want  to  go  there"  #feeling. 

This  is  a  straight  clone  of  the  trips  functionality  in 
Privatesquare  https  :  / /ww.  aaronland.  inf  o/weblog/2  014/04/2 
l /mirrors /#trips  which  cloned  the  idea  from  the  old  social- 
travel  website  called 

Dopplr  https://en.wikipedia.org/wiki/Dopplr 
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destination 

Search 

can  the  search  button  do  autocomplete  in  this  element? 

arrival 

YYYY-MM-DD 

ASIA  NORTH  n  ion  of  ASM  NORTH 

AMERICA  tUKURt  AMERICA 

departure 

ATLANTIC  OCEAN 

' 

YYYY-MM-DD 

status 

unknown 

ANTARCTICA 

notes 

+ 
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I  never  did  figure  out  how  to  assign  the  mapzen .  j  s  search  and 
geocoding  widget  to  another  element  outside  of  the  map... 


Who's  On  First  Bookmarks 


TRIPS  CITIES  NEIGHBOURHOODS  VISITS  LISTS 


#FEELINGS 


BROWSE 


@  ☆  : 


Q. 


Before  going  to 

NACIS  http  : / /localhost: 8080/blog/2017/10/17/whosonf irst 
-nacis-2  o  17/  last  year  I  spent  a  little  bit  of  time  writing  the  code 
to  generate  a  PDF  file  of  all  the  places  associated  with  a  trip,  with  an 
eye  towards  extending  the  functionality  to  any  list  view. 


This  work  got  stuck  between  the  part  where  we  use  the  Leaflet 
GeoJSON  plugin  http: / /leaf let js . com/ref erence- 
1.3.0. htmi#geo  json  to  draw  features  on  the  map  and  the  built-in 
ability  of  tangram.js  https  :  /  / github .  com/ tangrams  / tangram 
to  screenshot  an  image  of  the  current  map  to  include  in  the  PDF  file. 

The  problem  being  that  because  the  features  displayed  on  the  map 
are  being  drawn  by  a  completely  different  map  "layer"  managed  by 


Leaflet  http://ieafietjs.com/  that  the  tangr am .  j s  code 
doesn't  even  know  about,  from  tangr  am .  j  s 's  perspective  there 
are  no  features  to  draw  in  screenshot-mode. 


#  •  Who's  On  First  Bookmarks 
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•  Brooklyn  Heights  Lifestyle 

•  Restaurant  Jano 

•  Bookseller  SW  Welch 

•  L  Berson  &  Son 

•  Warshaw  Supermarket  Inc 

•  Main  St  Lawrence  Steak  House 

•  Moishe's  Steak  House 

•  Rush  Couture 

•  Montreal  Piscines  Interieures 

•  Restaurant  Casa  Maison  Minhota 

•  Compagnie  Marie  Chouinard 

•  Akoha 

•  Xiphos  Technologies  Inc 

•  Bar  Blizzarts 

•  Marais,  Marie 

•  Friperie  St-Laurent 

•  Toulch,  Michael  OD 

•  Segal's  Market 

•  Cinema  L'Amour-Xxx 

•  Productions  Loma 

•  Bar-Restaurant  Les  Champs 

•  Restaurant  Le  Vieux  St  Laurent 

•  Triangle  Portugais 

•  Steko  Films  Inc 

Search  for  a  place  in  W  SEARCH 


I  mentioned  earlier  that  there's  never  been  a  proper  interface  for  list 
views.  Browsing  all  the  nearby  venues  surrounding  a  point  is  a  good 
example  of  that. 
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<2> 


edit  4 


Whiskey  Cafe  Enr  974804941 

5800  Boul  Saint-Laurent  Montreal  QC  H2T  1T3 
45.527412  -73.602409 
lounge  cocktail 

edit  t 


✓  this  place  is 


open 

closed 

unknown  '  SAVE 


Le  Butterblume  1158786349 

5836  boul.  St-Laurent,  Montreal 
45.527607  -73.603034 
restaurant  brunch 

edit  i 


believe  this  place  is  "current" 


45.527094  -73.602530  17 


id:1 15878931 3  SEARCH 


Here's  a  screenshot  of  some  work  that  I  started  to  to  do  to  develop  a 
generic  view  (or  "card")  for  any  given  place.  This  included 
developing  a  range  of  colour-codings  to  apply  to  things  like  dots  on 
the  map,  to  signal  properties  about  a  venue  like  whether  it  was  open 
or  closed. 

I  always  imagined  that  these  views,  or  cards,  could  do  double-duty 
as  a  place  where  people  could  help 

improve  /biog/20i6/04/08/yesnof  ix/  the  underlying  Who' s 
On  First  data. 
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O 


Suriya  Thai  Restaurant  571507099 

sometime  in  the  past  to  2009-05-20 
1432  Valencia  St  San  Francisco  CA  941 10 
37.749913  -122.420627 

edit  i 


Pi  Bar  957732581 
2009-09 

1432  Valencia  Street,  San  Francisco,  CA,  USA 
37.749908  -122.420748 
pizza  beer  IPA 

edit  i 


/e  believe  this  place  is  "current" 


Tom  Ammiano  571713753 
1 41 7  Valencia  St  San  Francisco  CA  941 1 0 
37.749173  -122.420299 

edit  i 


Here's  a  tangible  illustration  of  the  the  Who's  On  First 
supersedes  and  superseded_by  properties  in  action.  These 
two  restaurants  "occupy"  the  same  building:  First  Suriya 
Thai  https://places.whosonfirst.org/id/5  71507  099  until 
early- 2009  and  then  Pi 

Bar  https : / / places . whosonf irst . org/ id/ 957732581 
following  that. 


--NOE- 

.VALLEY- 


(Bernal  Hills  Realty  ffi 

{Real  Eslate/Real  Estate  Agent/Services)' 


1  BERNAL] 
HEIGHTS] 


GLEN  PARK  . 


Holly  Park 


_ 


St  whosonfi rst.mapzen.com/classif  iers/#lat=37.7428&lng=-122.4216&z=1 4. 


WOF  Classifiers 

press  keys  to  isolate/exclude/reset 
press  1  -9  for  search  radius  (500m) 
press  L  to  show  every  label 
press  U  to  raise/lower  labels 
press  A  to  show  all 

O  retail  (press  r) 
doctor  (press  d) 

^  professional  (press  p) 

O  services  (press  s) 

Q  bar  (press  b) 

Q  food  (press  f) 

O  construction  (press  c) 

Q  transportation  (press  t) 

Q  others  (press  o) 


Speaking  of  colour  coding  dots  on  a  map  here's  a  screenshot  of  some 
lovely  work  that  John  Oram  https :  / /burrito  justice . com/ 
did  to  display  and  filter  venues  by  category. 


6  whosonfi rst.mapzen.com/classif  iers/#lat=37. 7411 0&lng=-122.42483&z= 


iValley-Stj 


;29llV5t= - 


II  1 

JL^OaySt. — -  ‘ 


(Retail/Laundry  &  Dry  Clea 


Fairmount  Elementary  School 


(Building  I 


O  retail  (press  r) 

^  doctor  (press  d) 

^  professional  (press  p) 
O  services  (press  s) 


This  isn't  work  that's  part  of  "Privatezen"  today  but  there  a  bunch  of 
useful  tricks  that  could  be  applied  from  John's  work. 

If  nothing  else  John  is  drawing  all  those  dots  in  tangram.  j  s  rather 
than  using  the  Leaflet  GeoJSON  plugin  which  means  if  we  did  the 
same  in  "Privatezen"  then  we  wouldn't  have  the  dotless- screenshots- 
for-use-in-PDF-files  problems  described  above. 


We'll  badger  John  in  to  writing  a  blog  post  about  his  work  soon. 


The  application  does  a  few  other  things  that  aren't  as  easy  to 
screenshot,  or  even  if  they  were  make  for  really  boring  screenshots. 
For  example,  "Privatezen"  is  designed  to  work  offline. 


It  works  offline  in  the  sense  that  because  the  SQLite  database  is 
sitting  on  your  computer  you  don't  need  to  fetch  everything  from  a 
remote  server  so  pretty  much  everything  except  search  and  maps 
(discussed  below)  will  work  as  expected,  regardless  of  your  internet 
connection. 


It  sort  of  works  offline  in  that  the  application  will  keep  a  local  cache 
of  Mapzen  tiles  that  it's  used  to  display  maps  and  once  a  map  has 
loaded  for  a  specific  place  (a  neighbourhood,  a  venue,  etc.)  then  it 
will  ask  tangr am .  j  s  to  take  a 

screenshot  https : / / github . com/ tangrams / tangram/blob/ ca7  9  8 
f 7  97  2ddf aa2  0997b2  6  If 07cbf d5 15caf 165/ src/ scene . js#L12 12- 
l  1 2 1 8  of  the  map  and  store  it  on 

disk  https : / / github . com/whosonf irst/electron-whosonf irst- 
bookmarks/blob/master/ javascript/mapzen . whosonf irst . bookm 
arks . screenshots . j  s 


The  next  time  you  visit  that  place  in  the  application  the  first  that  will 
happen  is  the  the  screenshot  will  be  loaded  underneath  the  map.  If 
you're  online  or  if  you  have  cached  map  tiles  the  screenshot  will  be 
covered  up  but  if  not  you  still  have  a  visual  representation  of  the 
place  you're  "looking"  at. 


It  doesn't  work  offline  when  it  comes  to  adding  new  things.  As 
mentioned  search  doesn't  work  but  even  if  you  happen  to  know  the 
Who's  On  First  ID  of  the  place  you're  trying  to  add  the  application 
needs  to  fetch  the  details  for  that  place  over  the  wire. 

Now  that  we've  started  building  bundled  SQLite  databases  of 
Who's  On  First  data  /downioad/#sqiite  I  can  imagine  a 
scenario  where  you  download  one  or  more  "bundles"  specific  to  your 
interests,  say  just  the  venues  in  one  or  two  cities,  and  those  are  used 
for  lookups  if  the  API  isn't  available. 


The  other  notable  place  where  "Privatezen"  doesn't  work  is  adding 
new  venues  that  don't  already  exist  in  Who's  On  First. 


I  100%  agree  with  everyone  who  thinks  it  would  be  really  useful  for 
people  to  have  a  mechanism  that  allows  them  to  create  new  places  in 
the  application,  even  if  they're  only  stored  locally. 


You  might  even  think  that  should  have  been  the  first  thing  I  worked 
and  maybe  I  should  have.  This  is  the  part  where  I  look  at  my  shoes 
and  mutter  something  about  "mornings  and  weekends"... 


I  think  I  punted  on  it  because  it  starts  to  get  complicated  pretty 
quickly  between  having  to  generate  and  store  local  IDs  and  then  to 
reconcile  them  with  Who's  On  First  IDs,  not  to  mention  the  lack  of 


an  API  or  even  a  simple  way  to  submit  a  pull  request  against  the 
data  Stored  in  GitHub  https  :  / /WWW.  github .  com/whosonf  irst- 
data  . 


These  are  all  things  that  need  to  be  done  but  here's  me  looking  at  my 
shoes  and  muttering  something  about  "mornings  and  weekends"... 


Nor  are  there  any  "clients"  beyond  the  desktop  application.  All  the 
fancy  talk  about  encrypting  and  decrypting  SQLite  databases  over 
the  network  is  just  fancy  talk  at  the  moment.  There  are  the 
beginnings  of  a  Cordova-based 

application  https  :  //github .  com/whosonf irst/cordova- 
whosonf  irst-bookmarks  to  do  just  that  but  so  far  it's  mostly  just 
been  a  reminder  of  why  programming  native/non-web  applications 
is...  challenging. 
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As  mentioned  the  desktop  application  is  an  Electron  application  so 
it's  all  written  in  JavaScript  and  CSS  and  a  tiny  amount  of  HTML. 

It's  mostly  a  lot  of  boilerplate  JavaScript. 

I  think  you're  supposed  to  use  something  fancy  like 
React  https://github.com/facebook/react  to  build 
applications  like  this,  these  days,  and  if  I  had  to  do  this  sort  of  thing 
40-hours  a  week  I  probably  would  have  invented  something  like 
React  too.  Instead  it's  just  a  lot  of  JavaScript  libraries  that  load  and 
call  one  another  and,  aside  from  the  part  where  "Privatezen"  works,  I 
am  willing  to  accept  that  I'm  "doing  it  wrong" . 

At  one  point,  building  "Privatezen"  became  an  exercise  in  trying  to 

refactor  or  write  all  of  the  Who's  On  First  Javascript 

libraries  https : //github.com/whosonf irst/ js-mapzen- 

whosonf  irst  to  work  in  both  a  browser  context  and  a 

nodejs  https :  /  /node  js  .org/  context.  They  don't  yet,  or  at  least 

not  all  of  them,  and  I  doubt  any  of  them  do  it  well.  But  that  is  the 

goal. 


I've  been  referring  to  the  project  as  "Privatezen"  in  this  blog  post  but 
the  actual  code  is  stored  in  a  repository  with  the  very  dull  name  of: 


https  ://github  .com/whosonfir  st/ electron- whosonfirst- 
bookmarks  https  :  / / github .  com/whosonf  irst/electron- 


whosonf irst-bookmarks 


There  are  build  instructions  for  the  applications  (eventually  there 
will  OS-specific  binary  versions)  but  that's  about  it  as  far  as 
documentation  goes  right  now.  Honestly,  this  blog  post  will  probably 
serve  as  the  best  piece  of  documentation  about  the  project  for  the 
time  being. 

There  is  also  the  awkward  reality  of  Mapzen  itself  shutting 
down  https : / /whosonf irst.org/blog/2018/ 01/ 02/chapter- 
two/  along  with  all  of  the  services  the  application  uses,  including 
the  Who's  On  First  API.  That  was  never  part  of  the  plan. 

I  actually  don't  think  it  will  be  a  big  deal  between  the  Nextzen  tile 
release  https : / /mapzen. com/blog/long-term-support-mapzen- 
maps/  and  Mapzen  Search  blossoming  into 

geocode  .earth  https : / /mapzen. com/blog/vl-searching-f or- 
the-road-ahead/  and  the  Who's  On  First  API  eventually  re¬ 
surfacing  http: / /localhost: 8080/blog/2 018 /O 1/02 /chapter- 
two/  (and  I  promise  it  will  if  only  because  I  need  it). 


It  might  be  nothing  more  complicated  than  swapping  out  endpoints 
and  API  keys  for  the  different  services  and  wouldn't  that  be  a  nice 
way  to  dull  the  pain  of  Mapzen  shutting  down? 


A  few  final  words  about  #feelings. 


At  State  of  the  Map  US,  last  year,  I  told  the 

Story  https : / /www. aaronland . inf o/weblog/2 017/10/24 /things 
/#sotm-us  of  the  minor  freak  out  I  had  during  a  meeting,  early  on 
at  Mapzen,  and  asking  people  to  accept  that  there  were  no  open 
venue  datasets  for  us  to  piggy  back  off  of.  The  consequence  of  that 
fact  is  that  a  lot  of  our  work  would  be  trying  to  find  new  ways  to 
have  people  contribute  data,  a  lot  of  it  data  that  many  people  had 
already  gone  to  the  trouble  of  adding  to  services  like  Foursquare. 


The  gem  that  fell  out  of  that  meeting,  for  me,  was  an  idea  for  a  brain- 
dead  simple  mobile  application  that  allowed  people  to  talk  about 
what  kind  of  place  you  were  in  and  what  sort  of  thing  you  were 
doing,  rather  than  a  hyper- specific  narrative. 

What  if  instead  of  saying  you  were  at  a  specific  place  with  a  specific 
address,  which  we  didn't  know  about  and  which  you  didn't  want  to 
waste  time  pecking  in  to  a  mobile  UI,  you  could  just  record  that  you 
were  "at  a  bar"  ?  And  what  if  the  way  to  add  meaning  and  nuance  to 
that  fact  was  to  add  a  "feeling"  to  the  signal. 

For  example,  what  if  you  could  say  "I'm  at  a  bar,  crying"  or  "I'm  in  a 
telco,  hulking  out".  The  list  goes  on.  Those  two  statements  (category 
plus  emotion)  combined  with  a  latitude  and  longitude,  multiplied  by 
a  lot  of  people,  start  to  fill  in  a  lot  of  gaps. 


It  was  meant  to  be  something  you  could  do  in  the  moment,  ideally 
with  nothing  more  than  your  thumb  and  without  making  your  friends 
feel  like  you  were  more  interested  in  your  phone  than  you  were  in 


them.  It  was  meant  to  be  something  that  could  convey  just  enough 
information  in  the  moment,  with  enough  wiggle  room  for  people  to 
play  with. 


Ideally  we  would  have  wanted  for  people  to  go  back  and  fill  in  the 
names  and  addresses  for  all  the  "bars"  and 
"restaurants"  https : / / github . com/whosonf irst/maplibs- 
meta/blob/master /data/buckets . json  where  they  were 

"drinking"  and 

"crying"  https : / / github . com/whosonf irst/maplibs- 
meta/blob/master/data/ feels  .  j  son  but  even  if  they  didn't  it 
would  be  enough  signal  for  us  to  start  working  with  and  using  to 
prioritize  future- work. 


{ 

" feels "  :  [ 

{ 

"label":  "being  the  decider", 
"label_clean" :  "being-the-decider " , 
"id":  135726611 

>, 

{ 

"label":  "crying", 

" label_clean" :  "crying", 

"id":  102681251 

}, 

{ 

"label":  "drinking", 

" label_clean" :  "drinking", 

"id":  102681341 

}, 

{ 

"label":  "eating", 

" label_clean" :  "eating", 

"id":  135877017 

}, 

{ 

"label" : 

"incentivizing" , 

"label_clean" :  "incentivizing" , 
"id":  135726609 


. . .  and  so  on. . . 


Sadly,  it's  not  a  project  that  ever  went  anywhere.  The  database 
schema  that  I  scribbled  out  on  the  whiteboard  that  afternoon  sat  there 
untouched  for  1 8  months  (maybe  more)  but  point  of  telling  this  story 
is  not  to  bemoan  our  inability  to  ship  the  simplest  and  dumbest  of 
user-facing  applications.  Instead  it  is  to  celebrate  the  silliness,  and 
the  usefulness  in  the  silliness,  of  the  framing  devices  we  imagined 
for  what  was  little  more  than  a  "check-in"  application. 

On  the  surface  recording  a  lot  of  people  "crying,  in  bars"  or 
"monetizing  like  crazy,  in  offices"  doesn't  seem  like  it  would  be 
useful  but  I  don't  buy  that  argument. 


Recent 

ID 

Latitude 

Longitude 

Someone  was  in  a 

And  they  were 

On 

17 

40.729596 

-73.986918 

Bar 

monetizing  like  crazy 

1438642577 

15 

40.744554 

-73.990304 

Office 

monetizing  like  crazy 

1438641220 

13 

37.775744 

-122.413614 

Office 

thought-leadering 

1438640928 

11 

37.775725 

-122.413671 

Spinny  Bar 

monetizing  like  crazy 

1438639987 

9 

37.775717 

-122.413627 

Hot  Dog  Joint 

thought-leadering 

1438639620 

7 

37.775727 

-122.413936 

Volleyball  Court 

crying 

1438638937 

5 

37.775744 

-122.413614 

Bike  Shop 

drinking 

1438638418 

I  remain  convinced  that  if  we'd  be  been  able  to  devote  a  little  more 
time  than  we  had  to  polishing  the  rough  edges  off  of  this  idea  we 
could  have  not  only  made  short  work  of  the  "no  open  data  venues" 


problem  but  also  had  fun  doing  it  at  the  same  time.  Fun,  it  turns  out 
is...  well,  fun. 


It's  something  I  try  to  remember  while  building  "Privatezen"  and 
why  the  application  has  #feelings. 


2018-02-02 
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WOF  In  a  Box  (part  3) 


WOF  In  a  Box  (part  3) 


This  was  originally  published  on  the  Who's  On  First 

weblog  https://whosonfirst.org/biog/20i8/02/20/wof-in-a-box-part3/  ,  in  February  2018 . 


The  Who's  On  First  Spelunker  lives  again!  Its  new  home  is: 


https://spelunker.wh0s0nflrst.0rg/  https  :  //spelunker . whosonf  irst .  org/ 


Any  old  Spelunker  URLs  should  continue  to  work  as-is  once  you  update  the  root, 
replacing  whosonf  irst  .mapzen . com/ spelunker  with 
spelunker.whosonfirst.org.  For  example,  the  URL  for  the  Who's  On  First 
record  for  Tokyo  https : //spelunker .whosonf irst . org/ id/ 102 031307/  used 
to  be: 


https://whosonflrst.mapzen.com/spelunker/id/102031307/  # 

And  now  it  is: 

https://spelunker.whosonflrst.org/id/102031307/  https : //spelunker .whosonf i 

rst.org/id/ 102 03 1307/ 


Yay! 


The  rest  of  this  blog  post  is  divided  up  in  to  two  parts,  one  nerdier  than  the  next. 
The  first  section  describes  a  couple  of  mechanical  changes  we  made  to  speed  up 
rebuilding  the  Spelunker  ("part  3"  of  WOF  in  a  Box). 


In  my  mind  everything  up  to  and  including  the  Spelunker  is  “near- 
term",  the  API  and  the  spatial  services  are  “medium-term”  and  the 
editorial  stuff  is  “longer-term".  It’s  not  ideal  but  it  seems  the  most 
realistic  given  whatever  world  of  new  everyone  involved  in  the  project 
will  be  negotiating  during  the  coming  year. 


The  second  section  builds  on  the  first  and  discusses  what  we've  been  up  to  post- 
Mapzen  https : //www. aaronland. info/weblog/2017/12/31/things/#chapter- 
two  and  where  things  are  going  next.  This  is  the  "medium-term"  work  outlined  in 
the  blog  post  that  followed  the 


announcement  https : //WWW. aaronland. info/weblog/2017/12/31/things/#ch 
apter-two  that  Mapzen  was  shutting  down. 

It's  pretty  technical  so  if  that's  not  your  jam  you  can  skip  it  all  in  good  conscience 
and  head  over  the  to  the  Spelunker  https://spelunker.whosonfirst.org/ 
and  start  spelunking  away  again. 


The  Spelunker  was  rebuilt  on  a  bare  Ubuntu 

16.04  https://wiki.ubuntu.com/XenialXerus/ReleaseNotes  Linux  server, 
following  Dan's  WOF  in  a 

Box  https : //whosonfirst.org/blog/2017/12/21/wof-in-a-box/  instructions 
and  everything  worked  without  a  hitch.  Along  the  way,  I  made  some  updates  to  the 
"fetching  and  indexing  data"  pieces  specifically  to  make  things  faster  and  easier  for 
people  who  just  want  to  work  with  the  data  as-is  and  don't  need  to  make  updates. 


These  updates  actually  introduced  some  " hitches "  in  Dan's  guide  but  with  Gary 
Gale  https :  / /www .  vicchi .  org/  '5  help  we  were  able  to  make  short  work  of 
those  bugs  and  everything  should  work  as  advertised  again... 


Historically  all  the  Who's  On  First  tools  have  been  designed  to  work  with  the  raw 

(plain-text  GeoJSON) 

data  http : //www. aaronland. inf o/weblog/2 0 16/08/ 15/things /#database 
contained  in  the  GitHub  repositories.  It's  important  that  there  always  be  tooling  to 
work  with  the  raw  "at  rest"  data  but  it  can  be  unnecessarily  fiddly  for  a  lot  of  people. 


We've  been  doing  a  lot  of  work  recently  to  distribute  Who's  On  First  data  as  SQLite 
databases  https :  //sqiite .  org/  .  SQLite  databases  have  the  advantage  of  being 
self-contained  with  widespread  support  in  a  variety  of  tools  and  programming 
languages.  They  don't  currently  include  alternate 
geometry  https : //github . com/whosonf irst/whosonf irst- 
cookbook/blob/master/how_to/creating_alt_geometries  .md  WOF  records 
but  those  aren't  necessary  for  most  of  the  things  that  people  want  to  do  with  WOF 
related  tools. 


The  first  tool  that  we  wrote  is  a  simple  command-line  tool  to  fetch  all  of  the 
databases  defined  in  an  inventory .  j  son  file.  These  inventory .  j  son  files 
are  still  a  work  in 

progress  https://dist.whosonfirst.org/sqlite/inventory.json  and 
currently  only  published  for  SQLite  databases.  As  we  chip  away  at  the  remaining 
work  to  generate  other  Who's  On  First 

distributions  https://whosonfirst.org/downioad/#distributions  the  plan 
is  to  adopt  and  adapt  the  inventory  files  accordingly.  More  on  that  below. 

Here's  how  to  the  wof-dist-fetch  https  :  //github. com/whosonfirst/go- 
whosonfirst-dist#wof-di st- fetch  tool  works.  In  this  example  we're  going  to 


ask  the  tool  to  fetch  every  database  listed  in  the  inventory  file,  and  store  them  in  a 
folder  called  /usr/local/data: 


>  wof-dist-fetch  \ 

-inventory  https://dist.whosonfirst.org/sqlite/inventory.json  \ 
-dest  /usr/local/data 
. . .time  passes . . . 

>  Is  -d  /usr/local/data/whosonf irst-data* 

/usr/local/data/whosonf irst-data-constituency-ca . db 
/usr/local/data/whosonf irst-data-constituency-ca-latest . db 
/usr/local/data/whosonf irst-data-constituency-us-latest . db 

/usr/local/data/whosonf irst-data-venue-us-wy-latest . db 
/usr/local/data/whosonf irst-data-venue-uy-latest . db 
/usr/local/data/whosonf irst-data-venue-za-latest . db 


You  can  also  filter  specific  databases  by  passing  one  or  more  -include  or  - 
exclude  flags.  For  example  if  we  only  wanted  to  download  postal  code  databases 
we  would  do  this: 


>  wof-dist-fetch  \ 

-inventory  https://dist.whosonfirst.org/sqlite/inventory.json  \ 

-dest  /usr/local/data  \ 

-include  ' whosonf irst-data-postalcode-*-latest . db ' 

The  wof-dist-fetch  tool  replaces  the  steps  described  in  the  Download  some 
data  https : //whosonf ir st . org/blog/2 0 17/ 12 /2 l/wof-in-a-box/#download 
section  of  Dan's  post.  It  will  still  take  a  while  to  download  all  the  SQLite  databases 
but  they  are  generally  smaller  and  easier  to  work  with  than  the  Git  repositories  and 
don't  require  setting  up  additional  tools  like  Git 

LFS  https : //github . com/whosonf irst-data/whosonfirst-data#git-and- 
large-files 


The  wof-dist-fetch  tool  is  part  of  the  go-whosonfirst- 

dist  https : // github . com/whosonf irst/ go-whosonf irst-dist  package  which 
will  also  be  used  to  generate  those  SQLite  files  and  other  distributions.  The  package 
contains  different  tools  to  fetch  and  build  distributions  but  the  latter  are  still  being 
actively  developed  so  you  probably  shouldn't  try  using  those  yet,  unless  you  are 
feeling  adventurous. 


The  second  tool  is  actually  just  an  update  to  one  of  the  very  first  tools  we  ever 
wrote.  The  wof-es-index  https://github.com/whosonfirst/py-mapzen- 
whosonf  ir st- sear ch#wof-es- index  tool  is  used  to  crawl  a  directory  full  of 
Who's  On  First  GeoJSON  data  files  and  index  them  in  the  Who's  On  First  Spelunker 
Elasticsearch  index  https : // github . com/whosonf irst/es-whosonf irst- 
schema  .  It  has  been  updated  to  also  read  and  index  WOF  data  stored  in  the  SQLite 
databases  we're  generating.  In  Dan's  original  blog 

post  https : //whosonfirst.org/blog/2017/12/21/wof-in-a-box/#index  you 
would  index  the  Spelunker  like  this: 


>  cd  /usr/local/data/whosonf irst-data 

>  wof-es-index  -s  .  — index=spelunker  -b 


And  now  you  can  do  this: 


>  wof-es-index  -b  -m  sqlite  /usr/local/data/whosonf irst-data-*-latest .db 

The  -s  ( ource )  flag  has  been  deprecated  and  replaced  with  the  -m(  ode )  flag 
which  can  index  a  number  of  different  data 

sources  https : // github.com/whosonfirst/py-mapzen-whosonfirst-index  : 
Git  repositories,  SQLite  databases,  one  or  more  GeoJSON  files  and  so  on.  It  is  also 
no  longer  necessary  to  pass  the  -i  ( ndex )  flag  since  it  defaults  to  "spelunker" 
now.  The  trusty  -b  ( ulk )  flags  remains  unchanged  for  speeding  up  indexing. 


So,  it's  only  a  pair  of  small  tweaks  to  Dan's  instructions  but  hopefully  they  are 
useful  ones. 


There's  been  a  lot  of  work  on  the  SQLite  databases  leading  up  to  those  tweaks 
including  adding  full-text  and  spatial  indexing  as  well  as  refactoring  the  code  in  to 
re-usable  components  so  that  it  can  be  used  for  other  kinds  of  data,  specifically 
Who's  On  First  brands  https://github.com/whosonfirst-data/whosonfirst- 
brands  and  Markdown  files. 


In  all  there  are  three  distinct  packages,  two  of  them  for  Who's  On  First  features  and 
brands  and  another  for  generic  Markdown  files,  and  a  fourth  shared  by  all  the 
others: 


https://github.com/whosonfirst/go-whosonfirst- 
sqlite/tinterfaces  https  :  //github.com/whosonfirst/go- 
whosonf irst-sqlite#inter faces 


https://github.com/whosonfirst/go-whosonfirst-sqlite- 

featlires  https : //github. com/whosonfirst/go-whosonfirst- 
sqlite- features 


•  https://github.com/whosonfirst/go-whosonfirst-sqlite- 

brands  https : // github . com/whosonf irst/go-whosonfirst- 
sqlite-brands 


•  https://github.com/whosonfirst/go-whosonfirst-sqlite- 

markdown  https : //github . com/whosonf irst/ go-whosonf irst- 
sqlite-markdown 


There  is  even  an  experimental  go-whosonfirst-sqlite  package  for  working 
with 

Privatezen  https : //www. aaronland. inf o/weblog/2 0 18/02 / 02 /things /#priva 
tezen  databases  but  it's  still  too  soon  to  discuss  that  yet. 

The  support  for  Markdown  files  is  not  as  much  of  a  non-sequitur  as  it  might  seem. 
Part  of  living  in  a  post-Mapzen 

world  https : //www. aaronland. inf o/weblog/2 017/ 12 /31/things/#chapter- 
two  has  involved  moving  all  the  Who's  On  First  blog  posts  (from  the  Mapzen 
website)  over  here  https :  //whosonfirst.org/biog  .  That's  meant  writing  our 
Own  suite  of  tools  https  :  //github .  com/whosonf  irst/ go-whosonf irst- 
markdown  to  render  the  Jekyll-flavoured  Markdown  files  that  we've  always  used 
for  blog  posts. 

Last  summer  we  launched  "  version 

2"  https : //whosonfirst.org/blog/2017/07/28/wof-website-redesign/  of 

the  Who's  On  First  website,  the  result  of  Scott 

Dombrowski  http://scottdombkowski.com/  's  internship  with  Mapzen.  It  has 


been  a  huge  UI  and  UX  improvement  and  helped  corral  all  the  many  different 
sources  of  documentation  in  to  one  place. 


On  the  other  hand  the  website  still  requires  more  work  than  it  should  in  order  to 
update  or  add  new  pages  so  "version  3"  will  essentially  be  Markdown  files  written 
by  hand  or  derived  from  machine  readable  data  that  will  be  rendered  as  HTML  with 
the  same  tools  that  I  wrote  for  the  blog. 


The  value  of  being  able  to  index  Markdown  files  in  SQLite  is  that  we  are  now  able 
to  do  full-text  search,  first  for  the  blog  but  ultimately  for  all  the  WOF  related 
documentation,  once  the  "version  3" 

work  https : // github .com/whosonfirst/whosonf irst-www/milestone/3  is 
completed: 


>  . /bin/wof-sqlite-query-markdown  -dsn  test.db  montreal 
15:49:12.499695  [wof-sqlite-query-markdown ]  STATUS  montreal 
15:49:12.499856  [wof-sqlite-query-markdown]  STATUS  montreal 
15:49:12.499906  [wof-sqlite-query-markdown]  STATUS  montreal 
15:49:12.499936  [wof-sqlite-query-markdown]  STATUS  montreal 
15:49:12.500024  [wof-sqlite-query-markdown]  STATUS  montreal 


/blog/2015/08/ 18/who-s-on-f irst/ 

/blog/ 2 0 17 /04/04/whosonf irst-api/ 
/blog/ 2 017/ 10/ 17 /whosonf irst-nacis-201 
/blog/ 2  017/12/2  2 /neighbourhood-updates 
/blog/2015/08/ 18/who-s-on-f irst/ 


Careful  readers  will  note  that  it's  still  not  possible  to  search  the  weblog  on  the 
weblog  itself.  It  will  become  possible,  I  promise,  just  not  today... 


We  used  the  blog  as  a  testing  and  proving  ground  for  how  full-text  search  should 
work  and  then  applied  those  lessons  to  the  tools  we  use  to  index  and  query 
brands  https : // github.com/whosonfirst/ go-whosonf irst-sqlite- 
brands#tools  : 


>  . /bin/wof-sqlite-query-brands  -dsn  test.db  'car*  bank' 

17:23:40.459620  [wof-sqlite-query-brands ]  STATUS  car*  bank  -  1125154403  Carolina  First  Bank 
17:23:40.459746  [wof-sqlite-query-brands]  STATUS  car*  bank  -  1125153083  Central  Carolina  Bank 


And  then  finally  for  actual  Who's  On  First 

documents  https : // github . com/whosonf irst/ go-whosonf irst-sqlite- 
features#wof-sqlite-query- features  : 


>  . /bin/wof-sqlite-query-features  -dsn  test2.db  JFK 
102534365 , John  F  Kennedy  Int ' 1  Airport 


>  ./bin/wof-sqlite-query-features  -dsn  test2.db  -column  names_colloquial  Paris 

85922583, San  Francisco 

102027181, Shanghai 

102030585, Kolkata 

101751929,  Troinso 


We've  also  added  the  ability  to  index  and  query  Who's  On  First  geometries  for 
people  who  have  the  [Spatialite](https://www.gaia-gis.it/fossil/libspatialite/index) 
extension  installed  on  their  computers . 


>  . /bin/wof-sqlite-index-features  \ 

-timings  -live-hard-die-fast  -spr  -geometries  \ 

-driver  spatialite  -mode  repo  \ 

-dsn  test.db  \ 

/usr/ local /data/whosonf irst-data-constituency-ca/ 

10:09:46.534281  [wof-sqlite-index-f eatures ]  STATUS  time  to  index  geometries  (87)  :  21.251828704s 

10:09:46.534379  [wof-sqlite-index-f eatures ]  STATUS  time  to  index  spr  (87)  :  3.206930799s 

10:09:46.534385  [wof-sqlite-index-f eatures ]  STATUS  time  to  index  all  (87)  :  24.48004637s 

>  sqlite3  test.db 

SQLite  version  3.21.0  2017-10-24  18:55:49 
Enter  ".help"  for  usage  hints. 

sqlite>  SELECT  load_extension ( ' mod_spatialite . dylib ' ) ; 
sqlite>  SELECT  s.id,  s.name  FROM  spr  s,  geometries  g 

WHERE  ST_Intersects ( g . geom,  GeomFromText (' POINT ( -122 . 229137  49.450129)',  4326)) 

AND  g . id  =  s.id; 

1108962831 | Maple  Ridge-Pitt  Meadows 


Now  that  we've  sorted  out  how  to  index  and  query  spatial  properties  in  the  SQLite 
databases  we've  also  begun  work  on  a  spatialite  branch  of  the  Who's  On  First 
point-in-polygon  (PIP)  server  https  : //github.com/whosonfirst/go- 
whosonf irst-pip-v2 /tree /spatialite  . 


The  idea  is  to  speed  up  indexing  time  and  memory  usage  (and  maybe  even  query 
time)  by  teaching  the  code  to  use  the  newer  SQLite  databases  instead  of  an  in¬ 
memory 

RTree  http : //www. aaronland. info/weblog/ 2 016/02/ 19 /things /#iamhere  . 
Our  hope  is  people  can  simply  download  one  or  more  of  the  SQLite  databases  using 
the  wof-dist-f  etch  tool  described  above,  point  the  PIP  server  at  those 
databases  and  having  a  working  services  in  a  matter  of  minutes. 


The  other  hope  is  that  these  SQLite  databases  can  be  integrated  with  existing  Who's 
On  First  applications,  like  the  Spelunker  or  the 

API  https : //www. aaronland. info/weblog/2  017/04/04/things/#api  ,  to 

provide  an  abbreviated  set  of  features  -  a  sort  of  "Who's  On  First  Lite"  -  for  people 
who  aren't  in  a  position  to  set  up  more  complicated  databases  like  Elastic  search. 


There's  still  some  heavy-lifting  and  hoop-jumping  to  complete  before  any  of  this  is 
possible  but  that  is  the  goal. 


There  are  three  reasons  we've  been  spending  so  much  time  on  the  SQLite  databases: 


1.  They  are  super  cool  and  super  useful  in  their  own  right. 

2.  Life  has  been  mostly  interupt-driven  since  the  Mapzen  shutdown  and 
hasn't  lent  itself  to  more  concentrated  work. 

3. 1  have  a  hunch  that  we  can  use  these  SQLite  databases  to  more 
effeciently  update  all  the  other  tools  and  services  whether  it's  the 
Spelunker,  the  API,  the  raw 

data.wllOSOnfirst.org  https://data.whosonfirst.org  files  Or 
Other  bulk  distributions  https  :  //whosonf irst.org/download/ 
that  people  might  want  to  download. 

But  that  means  building  those  SQLite  databases  first  and  doing  so  in  a  timely  and 
automated  fashion  when  changes  are  pushed  to  the  actual  Who's  On  First 
data  https://github.com/whosonfirst-data  .  That's  where  we're  at  today: 
Working  through  those  details  and  figuring  out  how  to  improve  on  the  bubble-gum 
and  duct  tape  solutions  https://github.com/whosonfirst/go-whosonfirst- 
updated#go-whosonf  irst-updated  that  got  US  this  far. 


The  goal  for  the  next  phase  of  work  is  to  set  up  a  reliable  workflow  for  generating 
both  bundles  https://whosonfirst.org//downioad/#bundies  and 
SQLite  https://whosonfirst.org//downioad/#sqiite  databases  for  each  of 
the  whosonfirst-data  https://github.com/whosonfirst-data  repositories, 
along  with  a  standardized  "inventory"  file  for  both  formats  that  can  be  consumed  by 
the  wof-dist-f  etch  tool,  and  then  to  use  those  distributions  to  update  the 
Spelunker  https://spelunker.whosonfirst.org/  and  the 
places.whosonfirst.org  https://piaces.whosonfirst.org/  endpoint. 


If  you're  curious  you  can  follow  along  in  the 

Sqlite  https : // github . com/whosonf irst/ go-whosonf irst- 


dist/blob/sqlite/cmd/wof-dist-build. go  and 
bundles  https : //github . com/whosonf irst/go-whosonf irst- 
dist/blob/bundles/cmd/wof-dist-build.  go  branches  of  the  go-whosonfirst- 
dist  https : //github . com/whosonfirst/ go-whosonf irst-dist  package. 

Once  that's  working  then  we'll  turn  our  attention  to  spinning  up  the  Who's  On  First 
API  https : //www. aaronland. inf o/weblog/2 0 17/04 / 04 /things /#api  again. 
No  one  is  working  on  this  "40  hours  a  week"  anymore  so  it  may  take  a  bit  longer 
than  we'd  like  but  hopefully  not  too  much  longer. 

In  the  meantime,  the  Spelunker  is  back!  https :  //speiunker. whosonfirst.org/ 

2018-02-20 
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The  Why  of  the  How 


This  was  originally  published  on  the  Who's  On  First 

weblog  https://whosonfirst.org/blog/2  018/02/27/why-of-the-how/  ,  in 

February  2018. 


This  is  a  bit  of  an  accidental  blog  post. 


I  mentioned  Gary  Gale  https://vicchi.org/  briefly  in  the  last 
blog 

post  https : / /www. aaronland. info/weblog/2018/ 02/20 /things/ 
#wof -in-a-box  .  Gary  and  I  go  back  to  Flickr  /  YlGeo  /  GeoPlanet 
( nee  Where  On  Earth ) 

days  https : / /www. aaronland . inf o/weblog/2009/12/2 1/redacte 
d/#woeir  and  lately  he's  been  spending  some  time  poking  at  all  the 
tools  and  data  that  make  up  Who's  On  First. 


What  I  love  about  Gary  is  that  he  has  the  diligence  and  the  patience 
to  find  all  the  outstanding  questions  and  gotchas  in  a  project.  Gary 
has  been  setting  up  a  local  instance  of  the 

Spelunker  https://spelunker.whosonfirst.org/  and  he  sent 
me  a  long  list  of  questions  about  how  the  Spelunker  does  and  doesn't 
work  with  newer  versions  of 

Elasticsearch  https  :  / /www. elastic  .  co/ products /elastic sea 
rch  .  The  gist  of  his  email  being: 


The  WOF  Spelunker  is  currently  based  on  ES  2.4.6 
and  there’s  been  a  lot  of  changes  that  Eve  had  to 
accommodate 


I  still  can’t  get  the  punctuation  filter  working 
properly.  Creating  a  new  index  with  this  in  the 
schema  dies  with  this  ... 


One  thing  that’s  caused  significant  headaches  is  that 
there’s  a  lot  of  type  inconsistencies  in  the  WOF 
GeoJSON,  with  fields  that  /should/  be  numeric  being 
specified  as  strings.  As  I’m  bulk  indexing,  1000  at  a 
time,  there’s  no  guaranteed  order  (or  shard 
guarantee)  for  the  documents  to  be  processed  in  so  I 
was  getting  a  lot  of  mapping  mismatches. 


For  now  I’ve  fixed  this  by  coercing  the  problematic 
fields  into  their  integer  values  so  the  mappings  are 
dynamically  created  consistently.  Specifically  this 
bites  when  processing  the  properties .wk: count, 
properties. qs fid  and 

properties.wof:concordances.qs:id  fields.  Also  oddly, 
properties. qs  fid  always  seems  to  be  zero  whereas 
propertie s . wof : concordances  .qs :  id  isn ’  t ? 


I  did  check  in 

https://github.com/whosonfirst/whosonfirst-json- 
SChema  https : / / github . com/whosonf irst/whoso 
nf irst- json-schema  but  it  looks  like  these 


documents  only  cover  the  minimum  viable  WOF 
schema  and  not  all  the  fields  available.  Either  that  or 
it’s  just  out  of  date. 


I  have  included  an  abbreviated  version  of  my  reply  below  because 
aside  from  answering  some  of  Gary's  immediate  questions  it  ended 
up  touching  on  a  lot  of  other  interelated  decisions  and  trade-offs  in 
both  the  design  and  the  implementation  of  Who's  On  First  that 
probably  haven't  been  discussed  or  documented  as  well  as  they 
should  be. 


One  of  the  things  I've  taken  to  saying  in  recent  years  is  that: 
"Sometimes  we  make  mistakes  because  of  circumstance  and 
sometimes  we  make  bad  decisions  because  of  reasons...  so  please 
just  write  those  reasons  down  somewhere." 


I'd  like  to  believe  we've  done  a  pretty  good  job  of  that  in  the  Who's 
On  First  codebase  itself  but  in  as  much  as  a  large  number  of  people 
are  never  going  to  read  those  comments  it  seems  like  it  a  useful 
practice  to  do  pursue,  here  on  the  blog,  as  well.  If  other  people  have 
similar  questions  we'd  be  happy  to  answer 

them  https://www.twitter.com/alloftheplaces/  and  maybe 
we  will  start  a  general  "why  of  the  how"  series  of  posts. 


In  the  meantime,  here's  what  I  said  to  Gary: 


Did  I  mention  I  have  #feelings  about  Elasticsearch  (ES)  ? 


Everything  you've  just  described  is  one  reason  we're  still  using  2.4.  It 
was  enough  of  a  time- sink  just  migrating  from  1  .x  to  2.x  that  there's 
never  been  a  compelling  reason  to  upgrade  again  beyond  "3  (or  5  in 
the  case  of  Elasticsearch,  which  skipped  versions  3  and  4)  is  bigger 
than  2". 

Worse,  I  began  to  have  little  confidence  that  updating  from  version  x 
to  version  y  wasn't  just  going  to  be  time  wasted  because  I  soon  as  I 
was  done  I  would  have  to  upgrade  to  version  z . 

I  realize  this  is  pretty  much  the  defining  characteristic  of  "modern 
software  development"  but  that  doesn't  make  it  right. 


I  suppose  if  Elastic  are  already  talking  about  "version  8"  it  might  be 
Worth  the  effort  https : / / readwrite . com/2  0 1 1/02/ 10/pinboard- 
creator-macie  j-ceglow/  but  it's  hard  not  to  feel  grumpy  about  it 
all.  Anyway,  all  the  ES  stuff  is  kept  here: 


•  https://github.com/whosonfirst/es-whosonfirst- 
SChema  https : / / github . com/whosonf irst/es- 
whosonfir st- schema 


The  first  thing  I  would  suggest  are  branches  and  PRs.  Even  just 
creating  a  milestone  and  outlining  all  the  migration  issues  would  be 
useful.  All  of  the  code  to  index  ES  is  kept  here: 

•  https://github.com/whosonfirst/py-mapzen- 
whosonfirst- 

searchlbloblmasterlmapzenlwhosonfirstl  search/ _ init_ 

_.py  https : // github . com/whosonf irst/py-mapzen- 
whosonf irst- 

search/blob/master/mapzen/whosonf irst/ search/ _ 

init _ . py 


And  here,  which  is  just  a  thin  non-WOF/Spelunker-specific  base 
class  which  wraps  the  handful  of  HTTP  requests  we  make  to  ES, 
because  every  ES  wrapper  library  out  there  tries  to  do  all  the  things 
and  quickly  becomes  more  trouble  than  it's  worth: 

•  https://github.com/whosonfirst/py-mapzen- 
whosonfirst- 

elasticsearch/blob/master/mapzen/whosonfirst/ elastics 
earch! _ init _ .py  https  :  /  /github. com/whosonf  irst 


/ py-mapzen-whosonf irst- 

elasticsearch/blob/master/mapzen/whosonf irst/ el 
asticsearch/ _ init _ . py 


In  the  beginning  we  relied  on  the  supposed  "schemaless  magic"  of 
ES  to  index  heterogeneous  documents  but  very  quickly  bumped  in  to 
the  thing  where  ES  tries  to  be  clever  and  then  shoots  itself  in  the  face 
when  it  comes  to  strings  versus  ints  and  of  course  EDTF  date 
strings  /blog/2017/ 06/29/tackling-space-and-time-in- 
whosonf  irst/  .  Because  there  was  never  time  to  go  on  a  prolonged 
ES  vision  quest  I  just  handled  it  all  in  code. 

•  https://github.com/whosonfirst/py-mapzen- 
whosonfirst- 

search/blob/master/mapzen/whosonfirst/ search/ _ init_ 

_.py#L62- 

L480  https : / / github . com/whosonf irst /py-mapzen- 
whosonf  irst- 

search/blob/master/mapzen/whosonf irst/ search/ _ 

init _ . py#L62-L480 


•  https://github.com/whosonfirst/py-mapzen- 
whosonfirst-search/blob/master/scripts/wof-es- 
prepare  https : / / github.com/whosonfirst/py- 
mapzen-whosonf irst- 

search/blob/master/ scripts /wof-es -prepare 


Eventually  we  made  some  of  that  code  redundant  with  better  schema 
definitions: 


•  https://github.com/whosonfirst/es-whosonfirst- 

schema/blob/master/ schema/2 .4/mappings  .spelunker.js 

on  https : //github.com/whosonf irst/es- 
whosonf irst- 

schema/blob/master/ schema/ 2 . 4 /mappings . spelunke 
r . json 


The  fact  that  all  the  indexing  code  is  in 

Python  https : / /github . com/whosonf irst ?language=python  is 
largely  an  historical  artifact  of  having  started  in  Python  (the  correct 
choice).  I  do  not  love  having  to  "prepare'1  documents  and  in  some 
ideal  world  we  could  just  throw  the  properties  dictionary  at  an  ES 
index  and  get  on  with  more  exciting  things. 


I  would  also  like  to  be  able  to  use 

Go  https : //github . com/whosonf irst ? language=go  to  index 
ES  because  it  is  simply  faster  and  we  could  distribute  pre-compiled 
binary  tools,  because  Python's  dependency  hoohah  is  reliably  sad- 
making.  The  combination  of  constructing  free-form  JSON 
documents  in  Go,  in  writing  all  the  scaffolding  code  to  deal  with  ES 
requests/responses  in  Go  and  just  limited  time  means  it  hasn't 
happened  yet. 


This  is  what  I  meant  when  I  said  we  use  Go  as  much  to  test 
decisions/as  sumptions  around  the  code,  with  a  strictly  typed 
language,  as  we  do  for  its  speed  and  muscle. 

I  see  this  inability  to  write  a  WOF  ->  ES  thingy  in  Go,  quickly 
and  with  only  a  manageable  amount  of  burden,  as  a  short¬ 
coming  and  something  to  be  addressed  going  forward.  We  have  a 
COUple  Other  places  https  :  / / github.com/whosonfirst/py- 
mapzen-whosonf  irst-export  where  we've  painted  ourselves  in 
to  a  language-specific  corner  (Python)  and  it's  useful  only  in  that 
it  illustrates  how  not  to  do  things  in  the  future. 

WOF  can't  support  all  the  languages  itself  but  it  definitely  shouldn't 
make  [  insert  language  here  ]  a  requirement  for  common  things... 
like  indexing  a  database.  Today  it  does. 

The  JSON  Schema 

Stuff  https : //github . com/whosonf irst/whosonf irst- j  son- 
schema  was  an  early  attempt  to  see  if  we  could  enforce  a  certain 
amount  of  consistency  and  quality  control  around  document  types 
without  making  the  mistakes  that  XHTML  2.0 

made  https : / /WWW. wired. com/2  0  09/ 07/the_w3c_buries_xhtml_ 
2dot0_html_5_is_the_f  uture_of_the_web/  around  Strict- ineSS 
(who  remembers  XHTML  2.0  right...?)  It  was  also  an  attempt  to  see 
if  it  could  be  used  to  automate  some  parts  of  Boundary 
Issues  http : / /localhost : 8 080 /blog /20 16/ 10/05 /boundary- 


issues-properties/  (the  editorial  tools)  which  are  written  in 
PHP/Flamework  https  : / /github.com/whosonf irst? 
language=php  and 

Javascript  https : / / github.com/whosonfirst? 

language= javascript  without  going  Full  Metal 

XForms  https://en.wikipedia.org/wiki/XForms  (no  offense 

to 

Micah  http://dubinko.info/blog/tags/standards/xforms/  ) 
about  everything. 

There's  a  whole  other 

discussion  http: / /localhost : 8080/blog/2017/05/10/ simple- 
is-hard/  about  how  Boundary  Issues  has  to  wrangle  and  round- 
trip  documents  and  forms  and  UI/UX  behaviour  based  on  property 
types  and  ACLs  but  that's  a  conversation  for  another  day. 


I  think,  as  I  write  this  now,  that  I  had  faint  hopes  of  being  able  to 
generate  the  ES  schemas  from  the  JSON  schemas,  or  at  least  use  the 
latter  as  a  starter  kit  for  the  former.  In  the  end  the  JSON  Schema 
stuff  never  warranted  the 

time  https : / /www. tbray . org/ ongoing /When/ 2  0  lx/ 2016/04/30/J 
soN-schema- funnies  to  prove  or  disprove  or,  more  specifically,  it 
was  going  to  take  too  long  prove  or  disprove  itself  either  way  and  we 
didn't  have  the  luxury  of  finding  out. 


I  am  totally  open  to  the  idea  that  we  might  be  able  to  revisit  JSON 
Schema  with  more  success  now  but  it  does  point  out  one  of  the  built- 
in  tensions  around  a  dataset  like  WOF:  Namely  that  outside  of  a 
handful  of  core/common/required 

properties  https: / /github.com/whosonf irst/whosonf irst- 
properties  which  should  enforce  strict-iness  in  any  dataset  as 
large  as  ours  there  will  be  type  inconsistencies.  It's  not  ideal  but  it  is 
also  entirely  realistic  to  expect  and  it's  not  clear  to  me  that  a  random 
qs  :  whatever  property  should  cause  a  fatal  error  that  prevents  the 
casual  user  from  getting  started. 

A  good  example  of  this  is  the  nightmare  around  LFS  and  large 
geometries  https  :  /  / github .  com/whosonf  irst- 
data/whosonf  irst-data#git-and-large-f  iles  (hello,  New 
Zealand)  which  is  why  we're  talking  about  making  the 
default/common  geometries  in  records  a  max  of  2-10  MB  and 
moving  all  the  "ground  truth"  geometries  in  to  dedicated  "you  will 
need  to  use  LFS"  repos. 

It's  also  the  motivating  factor  behind  the  "standard  places  response" 
(modeled  after  Flickr's  standard  photo 

response  https : / / code . f lickr . net/2  008 /08/19/standard- 
photos-response-apis-f or-civilized-age/  )  to  try  and  identify 
what  the  strictly  enforced  properties  in  a  record  are  and,  by 
extension,  what  we  and  consumers  should  "be  liberal"  about: 


•  https://github.com/whosonfirst/go-whosonfirst- 

SprMnterface  https  :  //github.  com/whosonf  irst/ go- 
whosonf irst-spr#interf ace 


See  the  way  the  definition  for  the  Id  (  )  method  returns  a  string 
instead  of  an  integer?  Yeah...  something  something  something  other 
data  sources  something  something  something... 


To  answer  your  immediate  question  of  "Is  this  stuff  all  documented 
somewhere?"  the  answer  is  yes,  here: 


•  https://github.com/whosonfirst/whosonfirst- 
properties/tree/master/properties  https :  / /github .  co 
m/whosonf irst/whosonf irst- 
properties/ tree /mas ter/ properties 


Some  of  those  records  are  incomplete  but  there  should  at  least  be  a 
placeholder  https : / / github.com/whosonfirst/whosonfirst- 
propert ies /blob/ master/ propert ies_template . j son  for  all  the 
properties.  Ultimately  I  would  like  for  all  those  machine  readable 
documents  to  be  used  to  generate  human-readable  documentation 
and  code-level  sanity  checking  (like  we  already  do  for  placetypes 
and  sources)  but  that  is  not  the  case  today: 

•  https://github.com/whosonfirst/whosonfirst- 

sources/tree/master/bin  https :  / / github .  com/ whoson 
f ir st /whosonfir st- sources /tree /mas ter /bin 


•  https://github.com/whosonfirst/py-mapzen- 
whosonfirst-sources/blob/ master/ utils/ mk- 
spec.py  https : / / github.com/whosonfirst/py- 
mapzen-whosonf irst- 

sources/blob/master/ util s/mk- spec . py 


•  https://github.com/whosonfirst/go-whosonfirst- 

sources/blob! master/ sources/ spec. go  https :  / /github 

. com/whosonf irst/ go-whosonf irst- 
sources/blob/master/ sources/spec . go 


•  https://github.com/whosonfirst/whosonfirst-www- 
api/blob/master/www/include/lib_whosonfirst _placetyp 
es_spec.php  https : //github . com/whosonf irst/whos 
onf irst-www- 

api/blob/master/www/ include /lib_whosonf irst_pla 
cetypes_spec . php 


On  top  of  all  this,  I  am  just  going  to  assume  that  all  the  query  syntax 
for  ES  6-8  has  changed  as  well?  Which  means  both  the  Spelunker 
and  the  API  will  need  to  updated. 

•  https://github.com/whosonfirst/whosonfirst-www- 
spelunker/blob/master/wwwlserver.py  https :  /  /githu 
b . com/whosonf irst/whosonf irst-www- 
spelunker/blob/master/www/ server . py 


•  https://github.com/whosonfirst/whosonfirst-www- 
api/blob/master/www/include/lib_whosonfirst _places.p 
hp  https : //github. com/whosonf irst/whosonf irst- 


www— 


api/blob/master/www/ include /lib_whosonf irst_pla 
ces . php 


In  case  you're  wondering  I  do  periodically  question  whether  or  not 
we  should  just  define  all  our  ES  queries  as  language-agnostic 
templates  but  after  a  few  moments  of  hating  myself  move  on  to  other 
things. 


So.  That's  maybe  more  than  you  were  hoping  for,  by  way  of  answers, 
but  welcome  to  my  world. 


It  seems  like  it's  time  to  spend  some  energy  on  ES  7.  Presumably 
there  is  no  point  in  stopping  at  6  if  8  is  already  being  discussed?  I 
would  start  by  making  a  new  branch  and  version- specific  folders  in: 

•  https://github.com/whosonfirst/es-whosonfirst- 

schema/ tree/ master/ schema  https :  //github .  com/wh 
osonf irst/ es-whosonf irst- 
schema/ tree/master/ schema 


And  then  updating  the  Python  libraries  accordingly.  There  is  nothing 
precious  in  them  so  most  things  are  fair  game,  short  of  embarking  on 
a  wholesale  Python  2  ->  3  rewrite  (which  is  in  the  cards,  but  not  right 
now...)  Removing  as  much  of  the  type-checking  hoohah  from  the 


Python  code  and  in  to  the  schema  would  be  a  good  thing  but  I  have  a 
feeling  expediency  will  dictate  keeping  some  of  the  former. 

The  subtext  here  is  that  I  too  would  like  to  have  a  dataset  where 
we  can  enforce  type-ish  consistency  across  the  board  and  where 
the  penalty  for  stopping  and  fixing  one  bad  record  isn't  the 
potential  of  having  to  repeat  the  process  ad  infinitum.  Our 
burden,  I  think,  will  be  to  live  in  the  interim  working  towards 
that  better  world... 

The  first  step  should  just  be  "indexing  the  data"  and  adjusting  the 
schemas  and  "prepare"  code  as  the  circumstances  demand.  After  that 
we  can  sort  out  /  update  all  the  query  nonsense.  If  we  need  to 
sacrifice  the  emoji 

Support  https : / / github.com/whosonfirst/es-whosonfirst- 
schema/ tree / master / synonyms#emo  j  i-synonyms  in  the  short¬ 
term  then  that's  probably  the  right  thing,  although  it  is  pretty  cool  to 
be  able  to  do  this: 

•  https:! 7 spelunker.whosonfirst.org/ search!  ?alt=  A 
https : // spelunker . whosonf irst . org/ search/ ? 
alt=& 


Good  times... 


One  immediate  side-effect  of  the  email  thread  with  Gary  is  that  there 
are  now  tools  to  crawl  all  the  Who's  On  First 
records  https : / / github . com/whosonf irst/go-whosonf irst- 
properties#toois  and  ensure  that  each  of  their  properties  has  a 
corresponding  record  in  the  whosonfirst- 

properties  https  :  /  / github .  com/whosonf  ir st /whosonf  irst- 
properties  repo  and  Gary  is  writing  code  to  test  those 
files  https : / / github . com/ vicchi/php-whosonf irst- 
properties 


Each  of  those  records  still  needs  to  have  descriptive  metadata  added 
and  some  of  the  records  will  be  bunk  and  in  need  of  being 
superseded  or  deprecated  but  at  least  its  progress. 


Tiny  steps  may  be  tiny  but  they  are  still  forward  momentum  so 
"Onwards!"  and  all  that  good  stuff... 
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Using  IMF  at  SFO  Museum 


Using  IMF  at  SFO  Museum 


Thumbnail  and  color  palette  of  a  photo  of  the  When  Art  Rocked  exhibition  posters  in  the  International  Terminal.  Photo  by  SFO  Museum. 


This  was  originally  published  on  the  SFO  Museum  Mills  Field 

weblog  https://millsfield.sfomuseum.org/blog/2018/07/18/iiif/  ,  in  July  2018. 


This  is  a  technical  blog  post  about  image  processing.  The  short  non-technical 
summary  is  that  not  only  were  we  able  to  use  open  source  software  to  simplify  our 
image  processing  workflow  ( and  reduce  costs)  but  we  contributed  our  improvements 
back  to  the  project  so  that  hopefully  others  in  the  museum  sector  may  benefit  from 
our  work.  Yay! 


The  past 

Some  quick  background  in  order  to  set  the  stage  for  the  rest  of  this  blog  post: 


•  Between  2012  and  2015 1  was  "Head  of  Internet  Typing "  and  part  of 
the  Digital  and  Emerging  Media 

team  https://iabs.cooperhewitt.org  at  the  Cooper  Hewitt 
Smithsonian  Design  Museum  https :  //cooperhewitt .  org  ,  in 
New  York  City.  One  of  my  responsibilities  was  wrangling  images  for 
all  their  different  uses  by  all  the  different  parts  of  the  museum:  The 
collection  website  itself,  the  API,  the  interactive  tables  in  the 
galleries. 


•  In  2012  the  first  verion  of  the  International  Image  Interoperability 
Framework  http :  //iiif .  io/  (DIF)  Image  API  was  introduced. 
IIIF  is  community  project  driven  by  public  institutions  and  private 
companies  in  the  cultural  heritage  sector  to  produce  common 
Standards  and  interfaces  http://iiif.io/technical-details/ 
(APIs)  for  accessing  and  working  with  collections  material.  These 
include  APIs  for  image  processing,  producing  presentations  and 
search. 


•  Sometime  in  the  summer  of  2016 1  became  aware  of  IIIF  and  that 
fall  I  wrote  an  implementation  of  the  IIIF  Image  API  in  the  Go 
programming 

language  https : //www. aaronland. info/weblog/2016/09/18/marsh 
mallows /#iiif  to  help  work  through  some  of  the  questions  I  had 
about  the  standard.  That  software  is  called  go- 
iiif  https://github.com/aaronland/go-iiif  . 


•  In  the  early  part  of  201 7 1  used  that  code  to  help  pre-render  zoomable 
image  tiles  for  all  the  objects  on  the  Cooper  Hewitt  collection 

Website  https://labs.cooperhewitt.org/2017/parting-gifts/  . 

That  involved  processing  289, 208  images  producing  58, 977,  784 
individual  tiles  (or  518  GB  of  data).  It  was  a  valuable  project  to  help 
understand  the 

limits  https : //www. aaronland. info/weblog/2017/03/05/record/# 
numbers  involved  in  using  IIIF  to  process  a  lot  of  images.  The 
whole  thing  took  about  45  days,  largely  because  there  were  readily 
apparent  bottlenecks  involving  CPU  and  disk  HO  that  1  wasn 't  able  to 
address  at  the  time. 


An  example  of  the  different  image  sizes,  and  color  palettes,  on  the  Cooper  Hewitt  collection  website.  ( Animated  GIF  by  Sha  Hwang.) 


During  the  years  that  I  was  at  the  Cooper  Hewitt,  for  any  given  object  on  the 
collection  website  https://collection.cooperhewitt.org/  there  would  be  a 
number  of  steps  to  produce  the  catalog  of  derivative  images  necessary  for  that 
object.  These  included: 


Generating  the  usual  set  of  small,  medium,  large  (and  everything  in 
between )  thumbnails  from  the  primary  image. 


•  Generating  a  square 

thumbnail  https : //labs .cooperhewitt.org/2013/b-is-for- 
beta/  that  was  cropped  around  a  focal  point  determined  by 

calculating  the  most  " interesting " 

part  https : //labs .cooperhewitt.org/2013/default-sort-or- 
what-would-shannon-do/  of  the  primary  image. 

•  Generating  a  black  and  white  halftone 

Version  http://mike.teczno.com/notes/atkinson.html  for  some 
of  the  smaller  th  umbnails  to  use  as  placeholders  while  the  large  full- 
color  images  were  loaded  in  the  browser. 

•  Extracting  and  indexing  the  dominant 

Colors  https://labs.cooperhewitt.org/2013/giv-do/  from  the 
primary  image. 

This  workflow  was  cobbled  together  using  a  combination  of 

GraphicsMagick  http://www.graphicsmagick.org/  and  a  series  of  purpose- 
fit  libraries  and  web  services  https://iabs.cooperhewitt.org/2014/http- 
ponies/  all  held  together  with  shell  scripts.  To  say  the  images  were  "cobbled 
together"  isn't  meant  to  speak  ill  of  the  process  (after  all,  I  wrote  most  of  it).  It 
simply  reflects  the  fact  that  almost  everything  done  between  2012  and  2015  was 
done  in  a  hurry  and  without  a  lot  of  polish,  itself  a  reflection  of  the  larger  work  to 
re-imagine  and  re-open  the 

museum  https : //mw2015 .museumsandtheweb.com/paper/strategies-against- 
architecture-interactive-media-and-transformative-technology-at- 

cooper-hewitt/  in  December  2014. 


The  present 


Life  and  Style  in  the  Age  of  Art  Deco 


See  all  81  photos  random 


■  □  ■  □  □ 


■  ■  ■  □  ■ 


□  □  ■  ■  □ 


"Smart"  cropped  thumbnails  and  color  palettes  of  installation  photos  from  the  Life  and  Style  in  the  Age  of  Art  Deco  exhibition.  Photos  by  SFO 

Museum. 


Fast  forward  to  2018  and  I  am  once  again  "Flead  of  Internet  Typing"  at  SFO 
Museum  https :  /  /f  lysfo . com/museum  .  I  actually  have  a  suitably-suitable  title 
befitting  my  role  as  a  civil  servant  but  it  rarely  seems  to  help  people  understand 
what  I  do  all  day.  Once  again  there  is  the  need  to  process  a  lot  of  images,  in  a  large 
collection,  for  a  variety  of  purposes. 


Knowing  that  I  could  always  fall  back  on  the 

tools  https :  //github .  com/cooperhewitt/  developed  at  the  Cooper  Flewitt  I 
decided  to  see  whether  we  could  integrate  all  of  the  Cooper  Flewitt  workflow  in  to 
the  go-iiif  code,  allowing  us  to  maintain  a  single  service  (iiif- 
server  https : //github .com/ aaronland/ go-iiif #iiif -server  )  rather  than 
multiple  services.  Could  we  do  just  as  much,  or  more  even,  with  less? 


The  short  version  is:  Yes  we  can! 


"Smart"  cropped  thumbnails  and  color  palettes  of  installation  photos  from  the  The  Nation’s  Game:  The  NFL  from  the  Pro  Football  Hall  of  Fame 

exhibition.  Photos  by  SFO  Museum. 


Some  of  the  implementation  specifics  remain  a  bit  rough  around  the  edges  but 
everything  works  and  can  be  improved  over  time.  Here's  an  annotated  version  of 
some  of  the  release  notes  https : //github . com/ aaronland/go-iiif /releases 
for  the  various  changes  we  made  to  go-iiif  https  :  //github. com/aaronland/go- 
iiif  in  the  process: 


Add  support  for  smart-cropping  via  (non-standard)  -1,-1,W,H 
regionByPx  instruction 


Under  the  hood  go-iiif  uses  the  bimg  https://github.com/h2non/bimg 
package  which  is  itself  a  wrapper  around  the 

libvips  https://jcupitt.github.io/iibvips/  image  processing  library. 
Version  8.5  https : // jcupitt . github . io/ libvips /20 17/03/ 16 /What ' s-new-in- 
8.5.  html  of  libvips  introduced  "...a  new  cropping  mode  called  "attention "  which 


searches  the  image  for  edges,  skin  tones  and  areas  of  saturated  colour,  and  attempts 
to  position  the  crop  box  over  the  most  significant  feature 


"Smart"  cropped  thumbnail  of  an  installation  photo  from  the  Life  and  Style  in  the  Age  of  Art  Deco  exhibition.  Photo  by  SFO  Museum. 


This  is  what  go-iiif  uses  in  place  of  the  Cooper  Hewitt's  "Shannon 
number  https : //labs . cooperhewitt . org/2013/def ault-sort-or-what- 
wouid-shannon-do/  "  for  generating  thumbnails  around  a  focal  point.  The  results, 
in  both  cases,  are  about  what  you'd  expect  from  a  computer  in  2018: 

Generally  the  results  are  good  enough  for  most  purposes  until  they  are  sometimes 
very  weird  (and  occasionally  very  very 

wrong  https  :  //www. youtube. com/watch?v=v9EKV2nSU8w  )  with  little  or  no 
reasoning  that  a  human  can 

discern  http : //berglondon . com/blog/2  01 1/08/03/the-robot-readable- 


world/ 


Add  preliminary  support  for  colour  extraction  as  a  profile  service 


Whereas  the  Cooper  Hewitt  uses  Giv  Parvaneh's 

RoyGBiv  https :  //github.com/givp/RoyGBiv  library  for  exacting  colors  from 

an  image  go-iiif  currently  uses  Rob  Cherry's 

vibrant  https://github.com/RobCherry/vibrant  package. 


As  of  this  writing  go-iiif  returns  specific  "swatches"  calculated  by  vibrant 
rather  than  a  ranked  set  of  colors.  They  are:  VibrantSwatch , 
LightVibrantSwatch,  DarkVibrantSwatch,  MutedSwatch, 
LightMutedSwatch ,  DarkMutedSwatch.  This  has  the  side-effect  of 
ensuring  a  broader  overall  range  of  colors  for  a  given  image  but  can  sometimes  still 
miss  obvious  matches  (to  human  eyes). 


The  principal  reason  for  this  decision  was  expediency.  As  much  as  I  would  have 
enjoyed  spending  work-hours  porting  Giv  http://www.givp.org/  's  code, 
written  in  a  different  programming  language,  it  doesn't  seem  like  the  best  use  of  my 
time  (yet)  given  everything  else  we're  trying  to  do. 


palette  for  photo  size  n 


iiif-server  palette 

■  #C4181F 

■  #D8B3A9 

■  #7B0B15 

■  #A8617D 
#C8C8B8 

ch-server  palette 

■  #061747 

■  #191970  (closest  css4  colour) 
#bebab4 

#c0c0c0  (closest  css4  colour) 
#d7d3cb 

#d3d3d3  (closest  css4  colour) 

■  #9c1117 

■  #b22222  (closest  css4  colour) 

■  #040611 

|  #000000  (closest  css4  colour) 


palette  for  photo  size  sq 


■  # A54060 
SA2D8DE 

■ #071E62 

■  #6C9799 
#A8D2C9 

ch-server  palette 

#93d2e1 

#87ceeb  (closest  css4  colour) 
#b1deda 

#b0e0e6  (closest  css4  colour) 

■  #769284 

|  #808080  (closest  css4  colour) 
■ #72a8b6 

|  #5f9ea0  (closest  css4  colour) 
■ #93ad9d 

|  #8fbc8f  (closest  css4  colour) 


Comparing  the  color  extraction  algorithms  in  the  go-iiif  and  RoyGBlv code  bases.  Meanwhile,  the  "smart"  cropping  algorithm  really  likes 
airplane  tailfins  http://www.elasticspace.com/20i2/02/robot-readable-world  .  It  almost  always  chooses  them  as  the  most  interesting  part 

of  a  plane. 


The  second  reason  is  that  the  go-iiif  code  is  designed  to  support  multiple  so- 
called  extruders  https : //github.com/aaronland/go-colours#extruders 
Extruders  are  the  bits  of  code  that  decide  which  colors  are  represented  in  an  image. 
Like  "smart"  or  "interestingness"  based  image -cropping  these  extruders  are  the 


biases  in  the  algorithms  https  :  //points  .  datasociety .  net/toward- 
accountability-6096e38878f  0?gi=864  9ac5c48f  6  that  increasingly  haunt  OUr 
daily  lives  https :  //ainowinstitute.org/  so  there  is  no  expecation  that  any 
one  extruder  will  suit  every  need. 

The  vibrant  code  returns  results  that  are  either  satisfactory  or  good  enough  to 
prove  the  point  that  there  is  a  working  model  and 

framework  http://iiif .io/api/image/2 . l/#profile-description  for 
extracting  colors  inside  of  go-iiif .  As  time  and  circumstance  permit  we  will 
cycle  back  to  improve  existing  extruders  for  extracting  color  palettes  and  add  new 
ones  along  the  way. 


Add  a  Dockerfile 


We  don't  process  images  in  real-time  so  our  need  for  an  image-processing  service  is 
going  to  occur  in  bursts.  Support  for 

Docker  https : // github . com/aaronland/ go-iiif#docker  allows  US  to  run  the 
go-iiif  server  as  an  on-demand  and  scalable  Amazon  Web  Services  Elastic 
Container  Service  https :  //aws .  amazon .  com/ecs/  rather  than  as  a  dedicated 
server  that  we  need  to  operate  (and  pay  for)  24  hours  a  day. 


Add  preliminary  support  for  (non-standard)  -1  rotation  to  explicitly 
disable  automatic  EXIF  Orientation  rotation 


I  don't  know  if  there  has  ever  been  a  discussion  in  the  IIIF  working 

groups  https : // groups . google . com/ forum/#! forum/ iiif-discuss  about  this 


but  if  you  are  implementing  the  IIIF  Image  API  specification  using  software  that 
automatically  rotates  JPEG  images  based  on  the  value  of  that  image's  EXIF 
Orientation 

flag  https : //www.daveperrett . com/articles /2 012/ 07/2 8 /exif-orientation- 
handiing-is-a-ghetto/  (which  most  JPEG  libraries  do  these  days)  but  which 
doesn't  update  that  image's  EXIF  Orientation  flag  accordingly  (which  most 
JPEG  libraries  don't  these  days)  then  you  end  up  in  a  fun-house  mirror- world  of 
wrong  if  you  transform  the  result  of  an  IIIF  transformation  a  second  time. 


go-iiif  has  a  flag  to  stop  this  from  happening  now. 


Boarding  Area  D 

See  all  237  photos  random 


"Smart"  cropped  installation  photos  from  the  The  Typewriter:  An  Innovation  in  Writing ,  the  Maneki  Neko:  Japan’s  Beckoning  Cat  and  the 
Classic  Monsters:  The  Kirk  Hammett  Collection  exhibitions.  Photos  by  SFO  Museum. 


So  how  does  this  all  work  in  practice?  Pretty  much  as  follows,  keeping  in  mind  that 
the  samples  below  should  be  treated  as  pseudo-code. 


First,  let's  assume  that  we're  running  a  copy  of  the  go-iiif  iiif- 

server  https://github.eom/aaronland/go-iiif#iiif-server  on  the  same 

local  machine  we're  working  on,  and  that  it's  listening  for  requests  on  port  8080. 


$>  bin/iiif-server  -config  config.json 

2018/03/07  15:45:07  Serving  127.0.0.1:8080  with  pid  12075 


We  maintain  a  dictionary  of  IIIF  instructions  where  each  key  is  a  named  label  and 
its  value  is  another  dictionary  of  IIIF  Image  API  instructions,  sometimes  called 
request  parameters  http : // iiif . io/api/ image/2 . l/#image-request- 
parameters  ,  specific  to  that  label.  For  example: 


$GLOBALS[ "cfg" ] [ "iiif_default_instructions" ]  =  array( 

#  -1  means  "do  not  autorotate"  (which  is  go-iiif  specific) 

"o"  =>  array ("size"  =>  "full",  "format"  =>  "rotation"  =>  "-1"), 

"b"  =>  array("size"  =>  " 11024,768",  "format"  =>  "jpg"), 

"c"  =>  array("size"  =>  "1800,600",  "format"  =>  "jpg"), 

"dd"  =>  array ("size"  =>  "1800,600",  "quality"  =>  "dither",  "format"  =>  "jpg"), 

"z"  =>  array("size"  =>  "1640,480",  "format"  =>  "jpg"), 

#  -1,-1  means  "smart  crop"  (which  is  go-iiif  specific) 

"sq"  =>  array("size"  =>  "full",  "region"  =>  "-1,-1,320,320",  "format"  =>  "jpg"), 

"d"  =>  array("size"  =>  "full",  "quality"  =>  "dither",  "region"  =>  "-1,-1,320,320",  "format" 
"n"  =>  array("size"  =>  "1320,240",  "format"  =>  "jpg"), 

); 


Photo  of  the  When  Art  Rocked  exhibition  posters  in  the  International  Terminal. 


Let's  say  we  want  to  take  the  photo  of  the  When  Art 

Rocked  https : //boingboing.net/2014/10/14/when-art-rocked.html 


exhibition  posters,  from  the  top  of  this  blog  post,  and  generate  a  square  thumbnail 
that's  been  halftoned  and  "smart"  cropped. 


We  store  these  instructions  using  the  d  key  which  the  states  that  the  IIIF  quality  for 
the  new  image  should  be  dither  (which  means  "halftone")  and  the  region  should 
be -1,-1,320,320  (which  means  "a  square  image  320  pixels  to  a  side,  where  the 
center  point  is  chosen  using  smart  cropping"). 


$img  =  "when-art-rocked.jpg"; 

$sz  =  "d"; 

$args  =  $GLOBALS[ "cfg" ] [ "iiif_default_instructions" ] [ $sz ] ; 

If  we  combine  the  instructions  for  the  d  label  with  a  set  of  default  rules  then  we  can 
easily  build  the  URL,  as  defined  by  canonical  URI 

Syntax  http : //iiif . io/api/ image/2 . l/#canonical-uri-syntax  , to  request 
the  new  image  from  the  iiif-server  instance,  like  this: 


$defaults  =  array ( 

"region"  =>  "full",  #  deprecated  in  3.x 

"size"  =>  "full",  #  deprecated  in  3.x 

"rotation"  =>  "0", 

"quality"  =>  "default", 

"format"  =>  "jpg". 


$args  =  array_merge($defaults,  $args); 

$url  =  "http: //localhost : 8080/{$img} /{$args [ "region" ] }/{$args [ "size" ] }/ {$args [ "rotation" ] }/{$args [ "q 
$img_d  =  http_get ( $url ) ; 


And  here's  the  output,  along  with  a  full-color  square  cropped  thumbnail: 


"Smart"  cropped  and  halftoned  thumbnail  photos  of  the  When  Art  Rocked  https://boingboing.net/20i4/i0/i4/when-art-rocked.html 
exhibition  posters  in  the  International  Terminal.  Photos  by  SFO  Museum. 


To  extract  the  colors  for  this  image  we  call  the  IIIF  image 

information  http :  //iiif .  io/api/ image/2  .  l/#image-information  endpoint, 
which  contains  an  additional  profile 


Service  http : // iiif . io/api/ image/2 . l/#related-services  for  include  color 
palette  data: 


$url  =  "http: //localhost : 8080/{$img}/info. json 

$info  =  http_get ( $url ) ; 

$colors  =  $info[ "service" ][ "palette" ] ; 

And  the  output  will  look  something  like 

this  https://github.eom/aaronland/go-iiif#palette-l  : 

[ 

{ 

"hex":  "#001689", 

"name":  "#001689", 

"closest" :  [ 

{ 

"hex":  "#e3256b" , 

"name":  "Razzmatazz", 

"reference" :  "crayola" 

}, 

{ 

"hex":  "#c71585 " , 

"name":  "mediumvioletred" , 

"reference":  "css4" 

} 

i, 

"reference":  "vibrant" 

}, 

. . .  and  so  on 

] 

There  are  many  different  ways  to  write  code  to  do  the  same  thing  and  the  point  of 
the  examples  above  is  not  to  suggest  that  you  should  do  it  our  way,  only  that 
whichever  way  you  do  it  should  be  that  easy. 


That  is  the  real  benefit  of  using  go -iiif  for  us.  If  we  need  to  add  a  new  image 
size  all  we  need  to  do  is  add  the  instructions  to  the  dictionary  above  and  write  a  little 
bit  of  custom  code  to  loop  over  all  the  images  in  the  collection  processing  just  that 
one  size  and  we're  done. 


Available  photo  sizes 

Label 

Width 

Height 

URL 

O 

3600 

2399 

b 

1024 

682 

https://.. ./static/115/9 34/301/1 /1159343011_A7WYQJj2Ap6ZPvXJXUY61QgdTVR47z0ZU9FmN2DFfmN7xHLYN  ksp_b.jpg 

c 

800 

533 

https://.. ./static/115/934/301/1/1159343011_A7WYQJj2Ap6ZPvXJXUY61QgdTVR47z0ZU9FmN2DFfmN7xHLYNksp_c.jpg 

dd 

800 

533 

https://.../static/115/9 34/301/1 /1159343011_A7WYQJj2Ap6ZPvXJX  UY61QgdTVR47z0ZU9FmN2DFfmN7xHLYNksp_dd.jpg 

z 

640 

426 

https://.. ./static/115/934/301 /I /1159343011_A7WYQJj2Ap6ZPvXJXUY61  QgdTVR47zOZU9FmN2DFfmN7xHLYNksp_z.jpg 

d 

320 

320 

https://.. ./static/115/934/301/1/1159343011_A7WYQJj2Ap6ZPvXJXUY61  QgdTVR47zOZU9FmN2DFfmN7xHLYNksp_d.jpg 

sq 

320 

320 

https://.../static/115/9 34/301/1 /1159343011_A7WYQJj2Ap6ZPvXJX  UY61QgdTVR47z0ZU9FmN2DFfmN7xHLYNksp_sq.jpg 

n 

320 

213 

https://... /static/115/9 34/301/1 /1159343011_A7WYQJj2Ap6ZPvXJXUY61QgdTVR47z0ZU9FmN2DFfmN7xHLYNksp_n.jpg 

The  same  is  true  if  we  need  to  reprocess  some  or  all  of  the  images  because...  well, 
because  everyone  has  different  reasons  for  needing  to  reprocess  all  their  images. 
These  things  happen  so  it  shouldn't  be  a  struggle  to  accomodate  them  when  they  do. 


TTTF  is  not  for  every  image  processing  task  but  given  that  most  image  processing 
tasks  are  some  combination  of  basic  region,  size,  rotation,  quality  and  final  output 
format  criteria  the  value  of  IIIF  is  in  dissolving  (hiding)  the  complexities  of  all  that 
work  behind  a  simple  URL  and  an  HTTP  request. 


The  future 


A  half  toned  installation  photo  from  the  A  World  of  Characters:  Advertising  Icons  from  the  Warren  Dotz  Collection  exhibition.  Photo  by  SFO 

Museum. 


So,  that's  the  state  of  go-iiif  today.  All  of  our  fixes  and  contributions  have  been 
merged  back  in  to  the  original  project,  which  is  available  on  GitHub: 

https://github.com/aaronland/go-iiif 

In  the  short-term  I'd  like  to  address  some  of  the  issues  around  color  extraction 
mentioned  above.  In  the  near-term,  I  would  like  to  add  support  for  a  pure  Go  image 
processing  "engine".  Somewhere  in  between  I'd  like  to  make  sure  that  all  the  images 
in  the  SFO  Museum  collection  are  tiled  and  zoomable. 


libvips,  the  code  that  does  the  actual  pixel  crunching  in  go-iiif,  is  a 
remarkable  piece  of  software  but  it  is  a  C  library  that  needs  to  be  compiled  and 
comes  with  a  non-trivial  set  of  dependencies  all  of  which  means  building  go-iiif 
requires  a  level  of  technical  expertise  that  is  outside  the  reach  of  many  people. 


A  pure  Go  image  processing 

engine  https :  //github .  com/ anthonynsimon/bild  would  allow  us  to  build  pre¬ 
compiled  binary  versions  of  the  go- ii if  tools  for  specific  platforms.  That  means 
you  or  your  institution  could  download  a  copy  of  the  software  and  start  using  it  right 
away  without  having  to  contemplate  phrases  like  "Just  install  {WORDS  THAT 
SOUND  LIKE  GIBBERISH  TO  YOU}..." 

There  is  also  the  possibility  of  using  a  pure  Go  version  of  the  iiif-tile- 
seed  https://github.eom/aaronland/go-iiif#iiif-tile-seed  program  as  an 
Amazon  Web  Services  Lambda  function  https :  //github . com/aws/aws- 
lambda-go  for  quickly  and  cheaply  pre-rendering  tiled  and  zoomable  images. 


The  Enduring  Designs  of  Josef  Frank 


See  all  50  photos  |  random 


■■■□■  ■□□□□ 


Thumbnails  and  color  palettes  of  installation  photos  from  the  The  Enduring  Designs  of  Josef  Frank  exhibition.  Photos  by  SFO  Museum. 


An  ideal  scenario  is  one  where  a  museum  could  upload  a  set  of  full-sized  images  to 
a  AWS  S3  bucket,  wait  for  Amazon's  computers  to  process  each  image  with  the 
iiif-tile-seed  function  and  then  find  a  new  set  of  tiled  images  to  download 
(along  with  a  reasonable  bill  for  services  rendered)  in  a  different  S3  bucket. 


That's  still  not  possible  today  but  it  should  be.  One  day  it  will  be. 


2018-07-18 
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Maps  (and  map  tiles)  at  SFO  Museum 


Maps  (and  map  tiles)  at  SFO 
Museum 


This  was  originally  published  on  the  SFO  Museum  Mills  Field 

weblog  https: //millsfield. sfomuseum.org/blog/2018/07/31/maps/  ,  in 

July  2018. 


This  is  a  long  blog  post  about  maps  that  gets  increasingly  technical 
as  it  goes  on.  The  short  non-technical  summary  is  that  maps,  and 
especially  maps  on  the  internet,  are  still  a  complex  beast  and  we  are 
beginning  to  stand  (wobble?)  on  the  shoulders  of  existing  open 
source  and  open  data  projects  to  develop  a  long-term  and 
sustainable  maps  infrastructure  for  the  museum. 


We  are  starting  to  think  about  maps.  Some  of  you  may  have  noticed 
a  lot  of  maps  in  the  pictures  from  our  last  blog 
post  https : / /www. aaronland. info/weblog/2018/ 07/ 18/thing 
s/#iiif  .  Those  maps  are  what  this  blog  post  is  about. 


Location  and  place,  as  you  might  imagine,  are  core  to  an  airport. 
They  are  essential  to  any  museum  and  museum  collection,  really,  but 
for  the  sake  of  this  blog  post  we'll  just  say  that  location  and  place  are 
core  to  our  museum. 

We  need  and  want  maps  to  contextualize  our  collection  whether  it's 
where  an  object  was  created,  where  the  institution  that  lent  us  an 
object  is  located,  where  in  the  airport  that  object  is  on  display  or 
where  on  the  larger  campus  a  photo  from  the  archive  depicts.  It 
might  also  be  where  an  airplane  is  manufactured,  where  it's  going  or 
where  it's  coming  from.  The  list  goes  on. 


San  Francisco  International  Airport  (SFO),  c.  1978  and  2017 


We  also  want  a  variety  of  different  maps.  We  want  maps  for 
wayfinding  but,  equally,  we  want  stylized  maps  whose  function  is  to 
be  the  soundtrack  to  a  larger  story.  We  want  historic  maps  to  help 
people  understand  the  evolution  of  the  airport  and  the  museum  itself. 

Maps  it  turns  out  are  also  a  useful  lens  to  help  think  about 
sustainable  infrastructure  in  general,  and  this  is  especially  true  for  a 
museum.  In  order  to  explain  why  we  need  to  take  a  brief  detour  and 
review  what  I  call: 


A  short  history  of  maps  and  the  internet 


is  perfect” 


What  follows  is  an  aggressively  simplified  history  of  maps  and  the 
internet.  People  in  the  "business"  of  maps  will  no  doubt  quibble  with 
the  nuance  and  probably  some  of  the  details  but,  in  broad  strokes, 
there  have  been  four  major  periods  so  far  in  this  history. 


Before  Google  Maps  (aka  "MapQuest") 


Saying  "before  Google  Maps"  is  sort  of  like  saying  "before  the 
discovery  of  fire"  for  all  that  most  people  can  relate  to  the  idea  that 
such  a  time  ever  existed.  Try  to  imagine  a  world  where  not  only  were 
maps  on  the  internet  static  but  were  really  really  small  because 
anything  bigger  was  prohibitive  in  bandwidth  speed  and  costs. 

Google  Maps  (2005) 


'Google  Maps  showed  how  it  was  possible  to  fake  the  appearance  of  continuous  flow  by  assembling  ...  images  on  the 
client"  from  Michal  Migur ski's  Tiles  Then,  Flows  Now  lecture  (2008). 


In  2005  Google  launched  their  Google 

Maps  https :  /  /maps .  google .  com  product  and  nothing  has  been 
the  same  since. 


Google  decided  that,  for  the  purposes  of  their  map,  the  planet  was 
really  a  sphere  (rather  than  egg-shaped)  allowing  it,  equally,  to  be 
represented  as  a  square.  Google  sacrificed  the  North  and  South  poles 

and  the  good  graces  of  cartographers  and 

geographers  https : //medium. com/ @awoodruf f /your-maps-are- 
not-iying-to-you-9c8c3ic599 if  ever  since  because  performing 
math  on  squares  was  easy,  even  in  the  underpowered  (by  today's 
standards)  web  browsers  of  the  day. 


Google  reduced  the  map  (the  entire  world)  to  squares  identified  by 
"zoom"  levels.  Zoom  level  "1"  contains  a  single  square  depicting  the 
entire  planet.  Zoom  level  2  contains  4  squares,  zoom  level  3  contains 
16  squares  and  so  on  with  every  zoom  level  containing  more  detail 
and  specificity. 

When  they  launched  there  were  16  zoom  levels;  16  maps  (pictures) 
of  the  same  thing  but  with  different  levels  of  resolution,  all  cut  up  in 
to  little  squares  of  the  same  size. 

Maps  on  the  internet  no  longer  meant  producing  and  downloading  a 
single  large  and  detailed  map  but  rather  downloading  lots  of  tiny 


pieces  of  a  map  http: / /mike. teczno.com/notes/de2008 .html 
and  reassembling  them  in  the  browser. 


OpenStreetMap  (2004) 


✓  OpenStreetMap  View  Edit  ▼  History 


Ixbarth  home  inbox  logout 
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The  browser-based  iD  OpenStreetMap  editor  (2013). 


I've  included 

OpenStreetMap  https://www.openstreetmap.org/  (OSM) 
after  Google  Maps  because  while  it  was  launched  a  year  before 
Google  Maps  it  wasn't  until  2007-2008  that  it  really  began  to  come 


in  to  its  own.  OSM  is  an  openly  licensed,  free  to  use,  map  of  the 
entire  world  created  and  managed  by  volunteer  users.  Both  the 
finished  map  and  the  raw  data  used  to  produce  that  map  are 
available  for  anyone  to  make  maps  of  their  own. 


OSM  is  not  without  its  controversies  but  it  has,  in  a  short  14  years, 
accomplished  the  impossible  demonstrating  that  a  community-run 
project  can  produce  a  finished  product,  and  the  infrastructure  needed 
to  do  so,  whose  quality  rivals  and  often 

exceeds  https : / /wiki .openstreetmap. org/wiki/ Completenes 

s  that  of  the  large  commercial  vendors. 


Google  didn't  stop  thinking  about  maps  after  2005  and  today,  in 
2018,  they  continue  to  outpace  their 

Competitors  https  :  / /WWW.  justinobeirne .  com/ google-maps- 
moat/  .  They  have  earned  their  success  but  they  also  have  resources 
and  operate  at  a  scale  that  is  so  far  removed  from  most  people's 
means  that  having  an  alternative  to  Google,  like  OSM,  is  critically 
important  https : / /WWW. inderapotheke.de/blog/farewell- 
google-maps  . 


Vector  Tiles  (c.  2012) 


{ "buildings" : { "type" : "FeatureCollection 
{ "geometry" : { "type" : "Polygon" , "coordina* 
[-122.4024169,37.7949617 J , [-122.4030374 
{ "name" s "Transamerica  Pyramid" , "area” : 4: 
roof_shape" : "pyramidal" , "height" s 260.001 
"roof_height" : "260  m” , "id" : 24222973} } , 

{ "geometry" : { "type" : "Polygon" , "coordina' 
[-122.4047032,37.7952034] , [-122.4045228 
37.7952246], [-122.4045246,37.7952342], [ 
-122.4045772,37.7952280] , [-122.4045950,: 
.7952386] , [-122.4046012,37.7952716] , [-1: 
[-122.4039039,37.7953649], (-122.4038888 
[-122.4045221,37.7949440] ,[-122.4045390 
{ "kind" s "f irestation" , "name" : "San  Fran< 
Fire  Station  13" , "area" : 1247 , "addr_stre< 
: "Sansome  Street" , "addr_housenumber" : "5! 
[-122.4033561,37.7961638 ] ] ] } , "type" : "Fei 
[-122.4039860, 37.7963012] ] ] } , "type" : "Fe< 
"properties" :{"id":35537150, "area" : 1140 
{ "type" : "Polygon" , "coordinates" : [ [ [ -122 


Vector  tile/ data  illustration,  from  the  Nextzen  website  (2018). 


I  think  the  term  "vector  tiles"  is  a  misnomer.  When  most  people  talk 
about  vector  tiles  what  they  are  really  saying  is  tiles  that  contain 

data  http: / /mike. teczno.com/notes/postgreslessness- 
mapnik-vec tiles .  html  describing  a  place  rather  than  tiles  that 
contain  a  grid  of  differently  colored  pixels  depicting  that  place,  what 
have  traditionally  been  called  "raster"  tiles. 


There  is  actually  a  rich  vein  of  work  encoding  and  visualizing 

complex  data  sets  in  the  red,  green  and  blue  values  of  plain-old 


raster  tiles  http://city.stamen.com/digital_city/01/  but 
we'll  save  that  discussion  for  another  time. 

A  rule  of  thumb  when  producing  map  tiles  at  a  global  scale  is  that 
everything  is  manageable  up  to  zoom  level  10,  things  start  to  get 
hard  at  zoom  level  13  and  zoom  level  16  is  where  you  start  to  think 
seriously  about  giving  up.  This  is  relevant  because  there  are,  and 
should  be,  many  legitimate  ways  to  depict  and  represent  the  same 
(map)  data.  In  2018  the  norm  is  for  most  maps  to  have  20,  or  more, 
zoom  levels  so  the  cost  of  producing  multiple  representations  of  the 
same  data  can  be  non-trivial. 

The  reason  that  vector  (or  data)  tiles  are  important  is  that  all  of  the 
work  of  rendering  a  map,  converting  the  data  in  to  a  representation, 
is  handled  by  the  consumer  of  the  map  rather  than  the  producer. 
Google,  and  others,  now  create  a  single  tile  set  containing  metadata 
about  the  stuff  in  that  tile's  geographic  extent. 


Generally  anyone  who  makes  a  vector  tile  set  also  has  some  kind  of 
styling  language  and  a  rendering  engine  for  turning  all  this  data  in  to 
pictures.  That  allows  everyone  to  take  advantage  of  all  the  advances 
in  computer  graphics  and  rendering  capabilities  in  modern  web 
browsers  and  devices  at  a  fraction  of  the  time  and  cost. 


And  there's  the  rub 


This  is  what  open  data  and  vector  tiles  and  WebGL  and  custom  styling  languages  make  possible.  Illustration  from  Tron 
2.0  — Creating  a  Visual  Language  of  Scale,  by  Geraldine  Sarmiento  and  Patricio  Gonzalez  Vivo  (2016). 


Vector  tiles  have  been  a  net- win  for  maps  on  the  internet  and 
cartography  in  general.  The  kinds  of  maps  they've  made  possible  is 
incredible  and  astonishing  http :  /  / geraldinesarmiento .  com/ 


The  downside,  however,  is  that  in  order  to  use  those  maps  you  need  a 
whole  lot  of  advanced  software  (and  often  hardware)  that  depends  on 
a  lot  of  other  very  advanced  software  (and  more  hardware  still, 
sometimes).  Often  two  software  packages  won't  keep  track  of  each 
other's  changes  and  occasionally  one  piece  of  software  simply 
chooses  not  to 

Work  https : //arstechnica . com/ gadgets/2018/ 06/the-end-of- 
opengl- support-other-updates -apple-didnt-share-at-the- 
keynote/  with  another  piece  of  software. 


There's  nothing  map-specific  about  this  phenomenon.  It  has  existed 
since  the  beginning  of  computing.  It's  just  a  problem  that  feels 
especially  acute  when  we're  talking  about  maps. 


For  example,  most  vector  tile  rendering  engines  use  something 
called  WebGL  which  requires  a  graphics  card  and  some  sort  of 
display  in  order  to  work.  The  results  can  be 
amazing  https://thebookofshaders.com/  .  The  downside  is 
that  rendering  static,  or  raster,  version  of  a  complex  vector  tile  on  a 
computer  without  a  display  is  so  difficult  as  to  seem  impossible. 


Raster  tiles  might  not  seem  like  a  concern  in  a  world  where  vector 
tiles  exist  until  you  remember  that  raster  tiles  will  be  included  when 
you  print  a  webpage  and  vector  tiles...  will  not.  Print  is  just  one 
example  but  there  are  plenty  of  situations  when  a  static  map  that 
doesn't  require  a  small  mountain  of  real-time  computing  to  realize  is 
needed. 

The  good  news  is  that  by  making  a  "tile  containing  data"  the  core 
unit  of  currency  for  maps  it  is  still  possible  to  produce  future 
representations  when  the  specifics  of  a  given  rendering  technology 
no  longer  fit  the  circumstances  of  the  world. 

The  bad  news  is  that  rendering  data  graphically  is  still  hard.  It  is 
probably  still  best  summed  up  by  the  expression  "death  by  a 
thousand  cuts".  People  who  use  maps  are  often  ill-equipped  and 


rarely  have  the  time  to  deal  with  those  details,  especially  so  when 
some  part  of  a  complex  processing  chain  breaks. 

So  that's  what  this  blog  post  is  about.  But  first,  we  need  to  take 
another  quick  detour  and  discuss: 

A  short  history  of  the  "(map)zengularity" 


Illustration  of  available  map  styles,  from  the  Nextzen  website  (2018). 


Between  2015  and  2017  I  worked  for  a  mapping  services  company 
called  Mapzen  https://en.wikipedia.org/wiki/Mapzen 


Mapzen  was  founded  in  2013  and  its  goal  was  to  build,  and  offer  as 
services,  all  the  tooling  that  it  takes  to  deploy  a  map  on  the  internet. 
This  is  all  the  stuff  that  Google  has  successfully  (artfully,  even) 
managed  to  hide  from  people  but  is  still  there  under  the  surface. 

Not  only  was  Mapzen  going  to  tackle  building  its  own  routing 
engine  https://github.com/vaihaiia  , search 
engine  https :  //peiias .  io/  ,  graphics  rendering 
engine  https :  / /github .  com/ tangrams  along  with  user-facing 
styling  tools  https://tangram.city/play/  for  both  browser- 
based  and  native  https  :  / / github .  com/ tangrams / tangram-es 
applications,  mobility  and  transit  https :  //transit .  land/ 
products,  a  gazetteer  https://www.whosonfirst.org  with 
global  coverage  and  developer  kits  for  using  everything  on 
Android  https://github.com/nextzen/android  and 
iOS  https://github.com/nextzen/ios  it  was  also  going  to 
design  its  own  cartography  https ://www. next zen. or g/#all- 
together-now. . .  and  produce,  manage  and  deliver  its  own  map 
tile  infrastructure  https://github.com/tilezen 

And  it  was  going  to  do  all  of  this  using  only  open  source  software 
and  open  data 

sources  https : / /web . archive . org/web/20 1712 14  021029/https : 
/ /mapzen. com/blog/our-magna-carto/  ,  in  particular 
OpenStreetMap.  It  was  an  ambitious  project  that  had  a  good  run  but 


eventually  shut  its  doors  in 

2018  https  : / /web . archive . org/web/20180102230013/https : / / 
mapzen.com/biog/ shutdown/  .  One  of  the  things  that  Mapzen  was 
able  to  do,  in  late  2017,  was  produce  a  final  "build"  of  its  vector  tiles 
dataset  for  the  entire  planet. 

These  tiles  and  the  cartographic  styles  that  were  developed  during 
the  Mapzen  years  now  live  under  the  umbrella  of  the 
Nextzen  https://www.nextzen.org/  project  and  are  freely 
available  for  anyone  to  use. 

That's  a  pretty  big  deal. 

So  that’s  what  this  blog  post  is  about  (for  real,  this 
time) 
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Location  and  place,  as  you  might  imagine,  are  core  to  an  airport.  They  are  essential  to  any  museum  and  museum 
collection,  really,  but  for  the  sake  of  this  blog  post  we’ll  just  say  that  location  and  place  are  core  to  SFO  Museum. 


Normally,  you'd  use  Nextzen  tiles  with 

Tangram  https://github.com/tangrams/tangram  a  Javascript 
library  for  rendering  the  Nextzen  data  as  beautiful 
maps  https://sensescape.github.io/morphology/  in  the 
browser.  That's  what  we've  been  doing  for  some  of  our  earliest  work 
in  thinking  about  how  to  make  the  airport,  the  museum,  the 
collection  and  the  internet  all  hold  hands. 

Tangram  is  an  amazing  piece  of  work  but  it  has  two  properties  that 
we  didn't  want  to  be  limiting  factors  for  the  use  of  maps  in  our  work 


going  forward: 


First,  WebGL  is  not  well- supported  in  older  browsers  and  older 
hardware  (even  on  modern  hardware  it's  not  hard  for  WebGL  to 
make  your  browser  work  so  hard  that  your  computer's  fan  turns  on). 
Second,  as  I  described  earlier,  WebGL  elements  on  a  webpage  are 
not  printable. 

We  want  to  make  sure  that  the  work  we  do  going  forward  is 
available  and  accessible  to  as  many  people  as  possible,  many  of 
whom  won't  have  the  luxury  of  the  latest  and  greatest  technologies. 
Or  even,  in  some  cases,  uninterrupted  access  to  the  internet  so 
seemingly  "little"  things  like  printing  a  web  page  become  all  the 
more  important  than  they  might  seem  at  first  glance. 


A  .v  I  write  this  I've  just  discovered  that  printing  a  'WebGL'  map  works  in  the  Chrome  browser  but  it  remains  the  outlier 
so  this  is  usually  what  you'll  see  when  you  try  to  print  a  map  rendered  using  WebGL. 


I  mention  all  of  this  because  the  data  that  Nextzen  returns  in  their 
vector  tiles  is  encoded  using  the 

GeoJSON  http://geojson.org/  format.  This  is  important 
because  it's  possible  to  convert 

GeoJSON  https : / / github . com/whosonf irst/ go-whosonf irst- 

svg  in  to  something  called  a  scalable  vector 

graphic  https : / / developer .mozil la . org/en- 

us /docs /web/svG  or  SYG.  SYG  is  a  format  for  describing  vector 

graphics  in  a  web  browser  which  means  we  can  convert  the  textual 

description  of  a  map  tile  in  to  a  graphical  representation  of  that  data. 


Almost.  SYG  files  are  actually  just  text  files  but  there  are  tools, 
besides  your  web  browser,  for  rasterizing 

them  https : / / github . com/whosonf irst/ go-whosonf irst- 
image  which  is  fancy-talk  for  turning  them  in  to  static  images. 

"rasterzen" 


We've  been  working  with  and  contributing  to  a  piece  of  software 
called  gO-rasterzen  https  :  /  /  github .  com/whosonf irst /go- 
rasterzen  that  does  everything  I've  just  described,  transforming 


the  raw  Nextzen  vector  tile  data  in  to  an  SYG  representation  and 
then  finally  a  PNG  image  file. 

https  ://github  .com/whosonfirst/go-r  asterzen 

It  works  like  this: 


•  Instead  of  asking  for  tiles  directly  from 
Nextzen  https://www.nextzen.org  a  map 
application  requests  them  from  a  rasterd  server 
(which  is  part  of  the  go-rasterzen  code). 

•  rasterd  proxies  tile  requests  to  Nextzen  and  then  for 
each  tile  it  receives: 

•  Performs  some  necessary  house-cleaning  producing  an 
intermediate  data  ( vector )  tile  called  the  rasterzen 
layer  and  then: 

•  Transforms  the  rasterzen  tile  in  to  an  SVG  tile  and 
then: 

•  Transforms  the  SVG  tile  in  to  a  PNG  tile  and: 

•  Caches  each  step  and  every  transformation  for  future 


use. 
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Name:  25368. png 
Dimensions:  512  x  512 
MIME  Type:  image/png 


And  just  like  that  we  have  three  new  tiles:  One  tile  we  can  use  with 
complex  rendering  engines  like  Tangram,  one  tile  we  can  use  vector¬ 
editing  tools  (like  Illustrator)  and  one  tile  that's  just  an  image  file. 


All  of  those  tiles  are  cached  on  machines  that  we  control  which 
means  that  if  Nextzen  suddenly  went  away  tomorrow  we'd  have  a 
little  bit  of  breathing  room  to  figure  out  what  to  do  next.  That  feels 
like  progress. 


In  a  world  where  everything  "is  a  service"  and  some  of  those 
services  have  abbreviated  lifespans  (at  best)  it  becomes  especially 
important  to  think  about  how  those  failure  scenarios  will  affect  our 
own  efforts  as  they  depend  increasingly  on  third-party  efforts. 


Known  knowns 


It  is  important  to  note  that  these  are  early  days  for  the  project  so 
there  are  lots  of  things  https://github.com/whosonfirst/go- 
rasterzen#important  that  might  prevent  you  from  using  it  right 
now.  In  no  particular  order  here  is  a  list  of  known-knowns: 

The  SYG  and  PNG  tiles  that  are  created  are  highly  stylized  and  can 
not  be  customized  at  the  moment.  Polygons,  points  and  lines  are  all 
stroked  black  and  polygons  are  assigned  an  arbitrary  (white)  fill  with 
a  10%  opacity.  Otherwise  the  backgrounds  on  all  the  tiles  are 
transparent. 


For  example,  here  is  a  single  map  tile  whose  img  container  has  been 
assigned  a  pink  background  color  (using  CSS)  which  is,  in  turn, 
contained  by  a  div  element  whose  background  color  is  yellow: 


#tile-samples  { 

width : 400px; 
margin:  0  auto; 
background-color:  yellow; 

} 

#tile-samples  img  { 

max-width : 2 OOpx  ! important; 
margin:  25px; 
background-color : pink; 
border :5px  solid  #000; 
display : inline ; 

> 

Did  you  notice  the  thin  border  around  the  edges  of  the  tile?  This  is 
an  artifact  of  the  way  that  ocean  and  coastline  data  in 
OpenStreetMap  is  encoded.  It's  a  known-known  that  all  tile 
rendering  libraries  account  for,  in  their  own  fashion,  but  rasterd 
hasn't  gotten  there  yet. 


Every  Nextzen  tile  includes  a  robust  catalog  of  metadata  that  can  be 
used  for  sorting,  filtering  or  drawing  the  map  data.  Here  is  a  map  of 
SFO  and  the  surrounding  area  where  every  feature,  including  the  bay 
and  the  coastline,  has  been  color-coded  according  to  its  Nextzen 
"kind"  value. 


rasterd  doesn't  take  advantage  of  this  metadata.  Yet. 

When  it  comes  to  PNG  (image)  tiles  the  easiest  way  to  think  about 
them,  right  now,  is  that  they  are  slow  to  produce.  Depending  on  how 
you've  configured  rasterd  this  can  lead  to  tiles  failing  to  render 
because  the  server  gets  exhausted  and  gives  up. 


Even  rendering  SYG  tiles  can  be  prohibitively  slow  depending  on 
the  location.  These  tiles  of  New  York  City  eventually  rendered  but  as 
you  can  see,  in  this  screenshot,  it  took  just  under  90  seconds  for  a 
single  tile  to  render  running  locally  on  my  laptop. 


This  website  is  part  of  SFO  Museum 
actor  [3  Console  D  Debugger  {}  Style  Editor  @  Performance  O  Memory  Net\ 
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rasterd  can  be  run  as  a  standalone  web  server  or  as  an  Amazon 
Web  Services  (AWS)  Lambda 

function  https :  //github .  com/whosonf  irst/ go- 
r aster zen#iambda  .  This  can  be  incredibly  useful  if  you  want  to 
pre-render  all  of  your  tiles  offline,  using  Amazon's  giant  computers 
to  do  all  the  work. 

If  you  want  to  use  rasterd  as  a  Lambda  function  that  is  part  of  a 
user-facing  endpoint  then  you  should  be  aware  that  this  will  involve 
a  few  extra  steps  beyond  the  usual  button-pressing  that  doing 
anything  with  AWS  involves. 


Environment  variables 


You  can  define  Environment  Variables  as  key-value  pairs  that  are  accessible  from  your  function  code.  These  are  useful  to  store  configuration  settings  without  the  need 
to  change  function  code.  Learn  more. 


RASTERD_PROTOCOL 

lambda 

Remove 

RASTERD_S3_CACHE 

true 

Remove 

RASTERD_S3_DSN 

[  Remove 

RASTERD_S3_0PTS 

ACL=public-read 

Remove 

RASTERD  PNG  HANDLER 

[  false 

Remove 

1  Key 

1  Value 

Remove 

►  Encryption  configuration 


Example  configuration  setting  for  running  rasterd  as  an  AWS  Lambda  function. 


•  You'll  need  to  configure  your  Lambda  function  to  run 
behind  an  AWS  API 

Gateway  https  :  /  / docs  .  aws  .  amazon .  com/  lambda/  lat 
est/ dg/with-on-demand-https -example . html 

endpoint  with  a  LAMBDA_  PR  OX  Y  integration. 


•  API  Gateway  has  a  maximum  execution  timeout  of  30 
seconds  ( compared  to  Lambda's  five  minutes)  which 
may  not  be  enough  to  render  a  larger  tile ,  especially  a 
PNG  tile. 


•  If  you  want  your  API  Gateway  endpoint  to  return 
PNG  files  as  actual  images  (and  not  base64  encoded 
blobs  of  text)  then  there  is  even  more  button- 


pressing  https : / / github . com/whosonf irst/ go- 
rasterzen#lambda-api-gateway-and-images  you'll 
need  to  do  when  setting  things  up. 

•  After  all  of  that  if  you  want  to  use  these  PNG  tiles  with 
a  mapping  library like 

Leaflet  https://ieafietjs.com/  you'll  still  need  to 
use  a  custom  library  to  fetch  those  tiles.  You  can  write 
your  own  code  to  that  but  there  is  also  code 
included  https : / / github . com/whosonf irst/ go- 
raster  zen#lambda-api-gateway-and-images-and- 
leaf  let  with  the  go-rasterzen package. 


A  screenshot  from  the  early  days  where  we  realized  we'd  need  an  intermediate  vector  tile  (the  "rasterzen"  layer)  to  crop 
Nextzen  data  that  exceeds  the  geographic  extent  of  a  tile. 


Lastly,  labels.  Have  you  noticed  that  there  are  no  labels  in  any  of  the 

screenshots?  That's  because  go-rasterzen  doesn't  even  try  to 

tackle  labels  right  now  because  labels  are  proper 

hard  https : / /web . archive . org/web/20 180 1022 122 14 /https : / / 

mapzen.com/blog/curved-labels/ 


The  confidence  to  try  something  else 


As  you  can  see  it  really  is  early  days  and  there  are  a  lot  of 
details  https  :  /  / github .  com/whosonf irst/go- 
r aster zen#import ant  to  account  for  but  it  works  and,  most 
importantly,  it  is  something  we  can  build  on  going  forward. 


C  Q. 
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Total:  1  sheet  of  paper 


This  website  is  part  of  SFO  Museum 
@sfomuseum 


aflet  |  ©  OSM  contributors  |  Nextzen 


height:  70vh; 

> 

. leaflet-container. leaf let-touch- 
drag,  leaflet-touch-zoom  { 
-ms-touch-actioo:  none; 

touch-action:  none; 

> 

leaflet. css: 63 

Printable  maps!  From  your  web  browser! 


The  maps  that  rasterd  produces  are  only  one  possible  map  and 
without  question  they  are  more  atmospheric  than  they  are  narrative. 
They  are  certainly  not  maps  to  land  airplanes  by,  for  example. 


They  may  not  even  end  up  being  our  "default"  map  and  I  think  it's 
fair  to  ask  whether  they  are  even  a  "good  enough  is  perfect"  map. 


Maybe  these  maps  are  simply  a  fail-safe,  pre-rendered  and  only  used 
when  nothing  else  works.  Their  value  then  comes  from  giving  us  the 
confidence  to  try  a  more  sophisticated  approach  while  still  having  a 
way  to  get  home  safely,  so  to  speak. 


The  map  as  soundtrack. 


Which  is  a  good  thing.  If  you'd  like  to  try  using  our  rasterzen 
map,  keeping  in  mind  all  of  the  provisos  mentioned  in  this  blog  post, 
you  can  do  so  here: 


https://millsfield.sfomuseum.org/map/ 

2018-07-31 


things  I  have  written  about 
elsewhere  #201 80807 

Old  maps  (and  old  map  tiles)  at  SFO  Museum 


Old  maps  (and  old  map  tiles) 
at  SFO  Museum 


Did  you  notice  the  aerial  overlay  of  the  airport  in  final  image  in  our 
last  blog  post  about  maps  /blog/2018/07/31/maps/  ?  That's 
what  this  blog  post  is  about. 

We've  updated  the 

map  https://millsfield.sfomuseum.org/map  to  allow  you  to 
overlay  historic  aerial  imagery  of  the  airport  for  over  a  dozen  years 
between  1943  and  2004.  We  also  have  a  stack  of  aerial  maps  of  the 
airport  for  every  year  since  2011  that  we're  aiming  to  get  online 
shortly. 


Many  of  these  images  are  accessioned 

photographs  https : //WWW. flysfo.com/museum/ collection/ adv 
anced-search?f ield_type_collection_tid=All&advanced- 
search=aeriai  in  the  museum's  collection.  They  are  also  part  of 
the  catalog  of  data  collected  by  the  airport's  GIS  department  and 
they've  done  the  work  to  "geo-rectify"  these  images. 


That's  the  technical  term  for  mapping  all  the  x ,  y  pixels  in  an 
images  to  their  corresponding  longitude  and  latitude 
coordinates  http://maps.nypi.org/warper  in  physical  space. 
And  that  means  that  we  can  put  these  maps...  on  the 
map  https://www.gdal.org/gdal2tiles.html  ,now! 


httpy/1ocalhost:8080/map/# 1 960/bg/1 4/37.61 82/- 1 22.3844 

1943  1947  1950  I9601*1  1978  1980  1981  1985  1988  1989  1998  2000  2002  2004 


Historic  maps  can  be  displayed  on  top  of  the  global  map  or  beneath 
it  to  help  you  understand  the  "shape"  of  the  airport,  both  literally  and 
figuratively,  at  a  point  in  time  relative  to  the  airport  today. 


We've  made  sure  that  the  year  an  aerial  image  was  taken  is  included 
in  the  map's  "permalink"  so  that  it's  easy  to  link  to  and  share  the  map 
at  a  specific  point  in  time  and  space.  For  example: 

https://millsfield.sfomuseum.Org/map/#1960/14/37.6160/-122.384 

7  https : / /mills field. sfomuseum.org/map/#1960/14/37 .6160/- 
122.3847 

Or: 

https://millsfield.sfomuseum.Org/map/#1960/bg/14/37.6160/-122.3 

847  https  : //millsf ield . sf omuseum. org/map/#l 960 /bg/ 14/37 . 6 
160/-122.3847 

To  indicate  whether  an  historic  map  should  be  placed  in  the 
foreground  or  the  background. 


We've  tried  to  make  sure  that  everything  renders  correctly  and  works 
smoothly  on  mobile  devices,  with  their  tiny  screens. 


Going  forward  it's  likely  that  we'll  replace  the  desktop  list  view  for 
years  with  aerial  imagery  using  a  select  menu.  Our  goal  is  to  have 
more  maps  to  choose  from  than  we  can  reasonably  expect  to  fit  in  a 
standard  text-based  list. 


hnp;//iQcalhost;8Q9Q/map/»i98l/i2/37,662i/-i223928 

1943  1947  1950  1960  1978  1980  1981|xl  1985  1988  1989  1998  2000  2002  2004 


Not  shown  in  any  of  the  screenshots  so  far  are  the  locate  me  and 
a  follow  me  around  buttons  beneath  the  list  of  available  maps. 
These  use  the  geolocation 

functionality  https:  / /developer . mozilla.org/en- 
us /docs /web /api /Geolocation  built  in  to  most  modern  web 
browsers  and  operating  systems  to  locate  your  device  out  there  in  the 
real  world. 


For  example,  the  image  above  shows  where  I've  been  writing  this 
blog  post.  In 


1981  https  : //mil Is field . sf omuseum. org/map/#1981/ 12 / 37.62 
21/-122 .3928  it  was  a  parking  lot. 

If  you  are  thinking  about  clicking  the  locate  me  or  follow  me 
around  buttons  and  wondering  Will  the  museum  (or  the  airport ) 
record  my  location  when  I  do?  the  answer  is:  No.  The  location  data 
collected  by  your  browser  is  only  held  by  your  browser  for  as  long  as 
you  keep  the  /map  webpage  open  or  until  you  click  the  clear  my 
locations  button. 

If  you're  a  person  who  likes  to  look  under  the  hood  and  see  exactly 
what's  going  on  you  can  review  the  source  code  in 
question  /  javascript/sf omuseum. maps  .geolocate.  js  . 


Here  is  where  the  recently  opened  Caticons:  The  Cat  in 
Art  https : / /www. flysfo.com/museum/exhibitions/caticons- 
cat-art  exhibition  is  located  on  the  A  side  of  the  International 
Terminal,  relative  to  the  airport  in 

1943  http  s : / /millsf ield. sf omuseum. org/map/#1943 


Here  is  the  Isamu  Noguchi:  Inside  and 

Out  https : / /www. flysfo.com/museum/ exhibitions/ isamu- 
noguchi-inside-and-out  exhibition,  also  in  the  International 
Terminal,  relative  to  the  airport  in 

1950  http  s : //mil Is field. sf0museum.0rg/map/#l 950/ 15/37 . 61 
55/-122.3898  . 


And  again  in  1943,  with  today's 

airport  https : / /mil Is f ield. sfomuseum. org/map/#1943/bg/15/ 


37.6155/-122.3898  overlayed. 


Do  you  see  the  pink  line  overlayed  on  the  map  in  this  photo? 


That's  a  visualization  of  the  web  browser's  geolocation  engine 
tracking  me  as  I  walk  from  the  International  Terminal  to  the 
connector  between  Terminals  2  and  3,  stopping  at  the  Jason 
Reblando:  New  Deal 


Utopias  https : / /www. flysfo.com/museum/exhibitions/ jason- 
reblando-new-deal-utopias  photography  exhibition. 


In 

1947  https  : //mil Is field . sf omuseum. org/map#1947/ 16/37.620 
3/-122 . 3858  neither  of  these  things  were  possible. 

It  is  important,  though,  to  remember  that  geolocation  in  a  web 
browser  and  especially  indoor  geolocation  is  still  in  its  infancy  so  the 
results  can  range  from  good  to  weird  to  crazy-town. 


Here's  an  example: 


This  screenshot  was  taken  midway  through  the  historic  map  tiles 
loading.  You  can  see  the  early  planning  phases  for  what  is  now 
boarding  area  F,  already  underway  in 


1960  http  s : //mil Is field . sf omuseum. org/map/#1960/ 16/ 37.61 
51/ -122 .3868  ,  and  today's  boarding  area  B  which  wasn't  even  a 
twinkle  in  anyone's  eye  back  then.  It  also  shows  me  walking  from 
the  International  Terminal  to  Terminal  1  and  things  more  or  less 
make  sense. 

The  path  itself  is  a  bit  wonky,  a  reflection  of  the  fact  that  moving 
around  too  much  or  too  quickly  seems  make  the  geolocation  APIs 
lack  confidence  in  deciding  where  you  are.  Conversely  the  squiggly 
lines  between  Terminals  2  and  3  (or  piers  B  and  C,  in  1960)  is  where 
I  stopped  to  take  the  photo  above  and  exchanged  some  text  messages 
with  a  colleague. 


From  Terminal  1  I  walked  up  to  the 

AirTrain  https : / / en .Wikipedia. or g/wiki/AirTrain_( San_Fran 


cisco_internationai_Airport  and  hopped  on  the  red  line  which 
does  a  loop  around  the  terminals.  At  this  point  I  was  out  of  range, 
more  often  than  not,  of  the  airport  wireless  network  and  things 
started  to  get  a  bit  strange. 

As  the  AirTrain  reaches  the  stop  for  Garage  A  there  are  fewer  and 
fewer  receievers  (wireless  or  cellular)  and  as  it  leaves  the  station  I 
am  suddenly  transported  out  on  the  airfield,  at  least  in  the  eyes  of 

the  web 

browser  http: / /WWW. scotusblog.com/2018/ 06 /opinion- 
analysis  -court-holds -that-police-wil 1 -general ly-need-a- 
warrant-f or-cellphone-location-inf ormation/  because  that's 
the  location  of  the  closest  receiver. 


Here's  another  example  traveling  back  to  West  Field  Road  from  the 
G  side  of  the  International  Terminal.  As  you  can  see  Westfield  Road, 


or  something  like  it,  was  already  there  in 

1950  http  s : //mil Is field. sfomuseum. org/map/#l 950/ 15/37 . 61 
48/ -122 .3898  (and  as  even  early  as 

1943  http  s : //mil Is field . sfomuseum. org/map/#l 943/ 15/ 37.61 
48/ -122 .3898  ). 


And  again  thirty  years  later  in,  in 

1981  https://millsfield.sfomuseum.org/map#1981  , before 


Central  Terminal  was  demolished  and  replaced  by  Terminal  2. 


And  finally  in 

2004  https  : //mil Is field. sf omuseum. org/map/#2 004/ 15/37 . 61 
48/-122.3898  ,  after  the  AirTrain  was  built  and  the  new  Intemation 
Terminal  was  opened  when  there  were  two  "boarding  area  A"s,  one 
in  the  Internation  Terminal  and  the  other  in  Terminal  1 . 


The  next  steps  for  this  project  are,  roughly  in  this  order: 

Adjust  and  clean  up  some  of  the  older  maps. 

Aerial  map  imagery  is  a  useful  lens  on  to  the  history  of  satellite 
imagery,  remote  sensing  and  scanning  technologies  over  the  years.  In 
2018,  we  are  used  to  crisp  perfect  imagery  but  that  wasn't  always 


possible  and  you  can  see  the  results  in  some  of  our  older  maps. 
There  are  tools  and  techniques  for  producing  if  not  sharper  images 
then  at  least  images  with  a  higer  contrast  so  we'll  start  to  investigate 
that  soon. 

Add  support  for  offline- viewing. 

These  maps  need  to  connect  to  the  internet  at  least  once  to  fetch  data 
but  after  that  they  should  "just 

Work"  https : / /f berriman . com/ 2  017/ 0  6/2  6/naming- 
progressive-web-apps/  whether  or  not  you  are  still  online.  The 
same  is  true  about  anything  the  museum  does  online,  really,  so  we're 
going  to  use  the  map  as  a  way  to  work  out  the 
details  https : / / abookapart . com/ products/ going-offline 
going  forward. 

Fill  in  the  gaps 

Ideally  we  would  like  historic  aerial  imagery  for  every  year  since 
1927  when  the  city  first  leased  the  land  the  airport  now  occupies. 
We'd  also  like  aerial  imagery  for  the  surrounding  Bay  Area  since 
SFO  exists  as  a  part  of  a  larger  community  and  those  stories  have  a 
relationship  with  our  story. 


Rinse  and  repeat 


Finally,  once  we  have  worked  out  all  the  kinks  and  understand  how 
things  work,  we'd  like  to  do  the  same  for  all  the  other  airports  that 
SFO  holds  hands  with.  Everybody  who  passes  through  SFO  is  either 
going  to  or  coming  from  another  airport  and  so  by  definition  their 
histories  are  entwined  with  ours. 

Until  then  we  invite  you  to  enjoy  the  old  and  new  maps  that  we've 
put  online  so  far  at: 

https :  //millsfield  .sfomuseum  .or  g/map/ 

2018-08-07 
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Who's  On  First  at  SFO  Museum 


Who's  On  First  at  SFO  Museum 


This  was  originally  published  on  the  SFO  Museum  Mills  Field 

weblog  https://millsfield.sfomuseum.org/blog/2018/08/28/whosonfirst/  ,  in  July  2018. 


Today,  we  are  happy  to  announce  the  first  release  of  historical  building  footprints 
and  interior  spaces,  including  galleries  and  public  art,  at  the  San  Francisco 
International  Airport  (SFO)  as  an  openly  licensed  dataset. 


The  data  spans  the  years  1954  through  2018  and  is  published  under  the  Linux 
Foundation's  Community  Data  License  Agreement  -  Permissive 
1.0  https://cdia.io/permissive-i-o/  (CDLA).  In  the  coming  weeks  and 
months  we  will  both  improve  and  update  the  data,  adding  similar  records  for  the 
years  preceding  1954. 


We  are  publishing  this  data  in  five  groupings.  Three  of  them  contain  spatial  data  and 
descriptive  properties  about  architectural  elements  at  the  airport  and  their 
surroundings.  They  are:  Architectural  features,  Public  art  and  something  called 
"Who's  on  First"  (which  is  described  below).  The  metadata  groupings  contain 
placetypes  and  properties  (in  the  data)  that  are  specific  to  SFO  and  SFO  Museum. 


These  data  are  not  an  "application",  but  rather  the  building  blocks  by  with  which 
many  different  applications  might  be  built.  Importantly  they  are  the  building  blocks 
that  allow  a  variety  of  different  applications  to  be  built  while  still  remaining 
interoperable,  providing  a  common  framework  for  everyone  to  talk  about  the  airport 
and  the  museum  and  our  collection  the  same  way. 

Architecture 

This  data  is  published  in  raw  form  on  GitHub:  https://github.com/sfomuseum- 
data/sfomuseum-data-architecture  https :  //github . com/sfomuseum- 
data/sfomuseum-data-architecture  .  It  is  also  available  as  a  SQLite  database 
from: 

https://millsfield.sfomuseum.org/distributions/sqlite/  https :  //milisf ieid. sfomu 

seum.org/ distributions/sqlite/sfomuseum-data-architecture- 


latest . db . bz2 


Architecture  data  consists  of  the  following  distinct  placetypes:  buildings,  terminals, 
boarding  areas,  gates  and  galleries. 

Buildings 


As  of  this  writing  there  is  only  "one"  building  included  in  the  dataset:  The  SFO 
terminal  complex.  SFO  has  changed  and  evolved  many  times  over  the  years  and  the 
data  contains  an  atomic  record  for  each  instance,  with  pointers  to  "the  SFO"  that  it 
replaced  or  was  replaced  by. 


The  SFO  of  today  won't  be  the  SFO  of  tomorrow  and  we  believe  that  it's  important 
to  be  able  to  unambiguously  refer  to  a  particular  incarnation  of  the  airport  at  a 
moment  in  time  with  the  confidence  that  the  details  won't  change.  Trying  to  speak 
about  SFO  in  the  1970s  is  tricky  when  your  only  referent  is  the  SFO  of  2018. 


There  are  lots  of  other  buildings  at  the  airport.  If  you've  visited  SFO  recently  you 
may  have  noticed  that  there's  a  new  hotel  under 


construction  https : //blog . sfgate.com/ cmcginnis/20 17/06/22 / f ly-thru- 
sfos -new-on- airport-hotel/  facing  the  A  terminal,  in  almost  the  same  spot 
where  the  old  Hilton  Hotel  https  :  //WWW.  flysfo.com/museum/aviation- 
museum- library/ collection  / 152 17  used  to  stand  in  the  1960s.  We  plan  to  add 
both  of  those  hotels  and  all  the  other  buildings  that  have  defined  the  airport 
complex,  over  the  years,  but  the  terminal  building  itself  seems  like  the  best  place  to 
start. 

You  can  see  all  the  buildings  at: 

https://millsfield.sfomuseum.org/architecture/buildings/  https : //milisfieid. 

sfomuseum.org/ architecture/buildings/ 

Terminals 


Responsive  Design  Mode  No  throttling  i  DPR:  2  s  Reload  when...  ^  ^  X 

320  x  480 

no  device  selected  s 

Mills  Field 


Terminal  1  (2017-) 


It  is  a  terminal  ( sometimes 
called  a  wing)  and  is 
parented  by  SFO  Terminal 
Complex  (2017-). 


In  1954  there  was  a  single  terminal  at  SFO.  In  1963  there  were  two.  By  1979 
another  terminal  had  been  added  and  in  2018  there  are  four  terminals,  and  every  one 
of  them  has  been  altered  or  reconfigured  in  some  way  over  the  years.  As  with 
buildings  we've  created  stable  and  permanent  identifiers  for  each  terminal  as  it  was 
built,  or  experienced  a  significant  change,  with  a  pointer  to  which  SFO  (the 
building)  it  was  part  of. 


You  can  see  all  the  terminals  at: 

https://millsfield.sfomuseum.org/architecture/terminals/  https : //millsf ield. 

sfomuseum.org/architecture/terminals/ 


Boarding  Areas 


Things  that  are  a  "boardingarea" 


rhere  are  75  of  them  /  see  all 

/ 

Boardinq  Area  A  (2017-).  which  is 

part  of  International  Terminal 

the  placetypes 

Boardinq  Area  B  (2017-),  which  is 

part  of  Terminal  1 

Boardinq  Area  C  (2017-).  which  is 

part  of  Terminal  1 

pA 

Boarding  Area  D  (2017-).  which  is 

part  of  Terminal  2 

Boardinq  Area  E  (2017-).  which  is 

part  of  Terminal  3 

Boardinq  Area  F  (2017-).  which  is 

part  of  Terminal  3 

Boarding  areas  are  to  terminals  as  terminals  are  to  buildings.  They  have  expanded 
and  contracted  and  even  swapped  names  with  one  another  since  1954.  For  a  short 
time  there  were  two  "Boarding  Area  A"s,  one  domestic  and  one  international.  As 
with  the  terminals ,  each  boarding  area  has  a  pointer  to  its  relative  hierarchy 
(terminal,  building)  at  that  point  in  history. 


You  can  see  all  the  boarding  areas  at: 

https://millsfield.sfomuseum.org/architecture/boardingareas/  https :  //milisf 

ield. sfomuseum.org/architecture/boardingareas/ 


Galleries 


Mills  Field 


Mp;//l.oca.!bo.sr8P8fl/rn^/M98Yl7/3.7.g1.6l6/il.2gJ.3Bg.89 
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Early  work  plotting  SFO  Museum  galleries  on  our  historic  map  imagery.  Not  seen  is  the  recently  opened  video  arts  gallery  but  it's  definitely  there! 


The  gallery  spaces  are  where  the  airport  and  the  museum  meet! 

In  2018  there  are  20  active  gallery  spaces  at  the  airport  but  since  the  museum  first 
opened  in  1980  there  have  been  over  50  such  spaces. 


We've  documented  all  of 

them  https://millsfield.sfomuseum.org/placetypes/gallery  and 
associated  each  one  with  its  relevant  boarding  area,  terminal  and  building. 
Contemporary  gallery  spaces  include  precise  location  information  while  older 


galleries  include  approximate  data,  usually  the  centroid  of  its  container  boarding 
area  or  terminal. 


If  you're  starting  to  see  a  pattern  in  the  relationship  between  the  different  placetypes 
you  might  be  able  to  guess  what  comes  after ,  or  is  " contained "  by,  the  galleries. 
We're  saving  that  for  a  future  release  but  it's  definitely  on  the  way... 


You  can  see  all  the  galleries  at: 

https://millsfleld.sfomuseum.org/architecture/galleries/  https : //millsf ield. s 

fomuseum.org/architecture/galleries/ 


Gates 


Only  2018-era  gates  (and  relationships)  are  available  in  this  release.  As  time  and 
circumstances  permit  we  will  add  historical  gate  information. 


Additionally,  the  location  information  for  these  gates  in  this  data  release  reference 
the  Federal  Aviation  Administration  https :  lit aa .  gov  's  (FAA)  idea  of  where 
the  gate  is  which  is  not  where  most  people  think  a  gate  is  located.  The  FAA 
considers  the  location  of  a  gate  to  be  the  point  where  a  jetway  meets  the  door  of  an 
airplane.  Passengers  tend  to  think  that  that  a  gate  is  located  at  the  other  end  of  the 
jetway,  where  it  meets  the  airport  itself. 


Don't  worry,  though:  The  airport  (SFO)  knows  where  all  of  these  "different"  gates 
are  located  and  that  data  will  be  included  in  a  future  update  to  the  data  that  SFO 
Museum  publishes. 


When  you  think  about  the  jetways  they  themselves  are  their  own  kind  of  unique 
architectural  space  at  an  airport.  It  seems  a  little  silly  to  try  and  model  them  along 
the  same  lines  as  terminals  or  boarding  areas,  if  only  because  they  move  around  so 
much.  That  said  once  we  have  stable  coordinate  data  for  either  end  of  a  jetway  there 
is  a  larger  avenue  to  imagine,  or  visualize,  these  otherwise  liminal 
spaces  https : / /www. sciencedirect.com/science/article/abs/pii/ S01607383 
18300094  . 

You  can  see  all  the  gates  at: 

https://millsfield.sfomuseum.org/architecture/gates/  https : //milisfieid. sfom 
useum.org/architecture/ gates/ 


Public  Art 


All  of  the  SFO  Museum  public  art  Geo J SON  data  converted  in  to  an  ESRI  Shapefile  using  the  go-whosonfirst-shapefile  tool  and  then  converted 
back  in  to  GeoJSON  using  the  ogr2ogr  tool  and  then  displayed  on  the  geojson.io  website...  phew! 


This  data  is  published  in  raw  form  on  GitHub:  https://github.com/sfomuseum- 
data/sfomuseum-data-publicart  https  :  //github .  com/sfomuseum- 
data/sfomuseum-data-pubiicart  .  It  will  be  available  as  a  SQLite  database 
shortly. 

SFO  Museum  has  over  100  works  of  public  art,  located  throughout  the  airport, 

purchased  through  the  San  Francisco  Arts 

Commission  https://sfartscommission.org/  (SFAC). 


The  public  art  dataset  is  a  lens  on  to  the  world  of  open  data  publishing.  The  data 
was  sourced  from  records  published  in  the  Who's  On  First  California  venues 
dataset  https : //github . com/whosonf irst-data/whosonf irst-data-venue-us- 
ca/  .  That  data  was  imported  in  to  Who's  On  First  (WOF)  from  the  DataSF  open 
data  portal  SF  Civic  Art  Collection  https : //data. sfgov.org/cuiture-and- 
Recreation/SF-civic-Art-Coiiection/zfw6-95su  data  file.  SFO  Museum  has 
taken  the  WOF  data  and  updated  it  with  accurate  location  data  and  museum/airport 


specific  metadata.  We  will  contribute  our  changes  back  to  the  WOF  venue  dataset, 
because  we  want  these  works  to  be  shared  far  and  wide,  but  plan  for  our  public  art 
dataset  https : //github . com/sfomuseum-data/sfomuseum-data-publicart  to 
be  the  "source  of  truth"  from  now  on. 


You  might  be  reading  this  and  thinking  "What  is  this  thing  called  Who's  On  First 
that  keeps  being  mentioned?"  All  your  questions  are  answered  in  the  following 
sections. 


You  can  see  all  of  the  public  art  at: 

https ://millsfield .sfomuseum.org/publicart/  https://miiisfieid.sf omuseum . o 
rg/publicart/ 


The  rest  of  the  world  (aka  "Who's  On  First") 


Postal  Codes 

There  is  one  thing  that  is  a  postal  code  /  see  all  the  placetypes 


94128 ,  which  is  part  of  San  Mateo 


This  data  is  published  in  raw  form  on  GitHub:  https://github.com/sfomuseum- 
data/sfomuseum-data-whosonfirst  https  :  //github.com/sfomuseum- 
data/sfomuseum-data-whosonfirst  .  It  is  also  available  as  a  SQLite  database 
from: 

https://millsfield.sfomuseum.org/distributions/sqlite/  https :  //milisf ieid. sfomu 

seum.org/ distributions/sqlite/sfomuseum-data-whosonf irst- 
latest . db . bz2 


All  of  SFO  Museum's  datasets  hold  hands  with  the  Who's  On 
First  https://whosonfirst.org/  gazetteer. 


If  you  are  not  familiar  with  the  term  "gazetteer"  it's  basically  just  a  giant  phonebook 
of  places,  rather  than  people.  Instead  of  creating  a  museum-specific  record  for  the 
city  of  San  Francisco  https  :  /  /milisf  ield.  sfomuseum.org/ id/ 85 92 2 58 3/  or 
the  state  of  California  https : //milisf ield. sfomuseum.org/ id/85688637/ 
and  so  on,  we've  chosen  to  use  the  Who's  On  First  records  for  those  places. 

The  entire  Who's  On  First  dataset  contains  26  million 

records  https : //spelunker .whosonf irst . org/placetypes/  ,  of  which  there 
are  over  700,000  administrative  places  https :  //github.com/whosonfirst- 
data/whosonfirst-data/  .  The  s  f  omuseum-data-whosonf  irst  dataset  is 
a  repackaging  of  only  those  records  that  the  SFO  Museum  datasets  need  to  make 
sense.  Currently  that's  limited  to  a  handful  of  places  in  San 
Francisco  https://millsfield.sfomuseum.org/id/85922583/  , 

California  https://millsfield.sfomuseum.org/id/85688637/  and  the 
United  States  https : //milisf ield. sfomuseum.org/ id/85633793/  but  more 
places  will  be  added  as  the  overall  work  we're  doing  progresses. 


This  dataset  bundles  the  following  placetypes: 

Campuses  https : //millsfield. sfomuseum.org/placetypes/campus/  ,  postal 


codes  https://millsfield.sfomuseum.org/placetypes/postalcode/  , 
localities  https : //mills field. sfomuseum.org/placetypes/locality/  , 
Counties  https://millsfield.sfomuseum.org/placetypes/county/  , 
regions  https://millsfield.sfomuseum.org/placetypes/region/  and 
Countries  https : //millsfield. sfomuseum.org/placetypes/country/  . 

sfomuseum-properties  and  sfomuseum-placetypes 


These  data  are  published  on  GitHub  from: 

https://github.com/sfomuseum/sfomuseum- 

properties  https : //github.com/sfomuseum-data/sfomuseum-properties  and 
https://github.com/sfomuseum/sfomuseum- 

placetypes  https : //github .com/sfomuseum-data/sfomuseum-placetypes 
respectively. 


All  of  the  datasets  are  described  with  a  common  set  of  metadata  properties  as  well 
as  any  additional  properties  unique  to  the  record  in  question  or  the  source  that 
published  it. 


These  properties  are  denoted  by  using  a  {NAMESPACE}  :  {PREDICATE }  = 
{VALUE}  syntax.  For  example:  sfomuseum:placetype=terminal  or 
s  f  omu  s  eum : map_i d= D 1 2 . 


The  sfomuseum-properties  https  :  //github .  com/sfomuseum-data/sfomuseum- 
properties  data  is  where  all  the  properties  whose  namespace  is  sfomuseum: 
are  described  as  machine-readable  documents  that  can  be  used  to  generate  human- 
friendly  documentation  or  to  validate  data  using  software. 

The  sfomuseum-placetypes  https  :  //github. com/sfomuseum-data/sfomuseum- 
piacetypes  data  is  where  all  the  SFO  Museum  -designated  placetypes  (buildings, 
terminals,  etc.)  are  described  as  machine-readable  documents  that  can  be  used  to 
generate  human-friendly  documentation  or  to  validate  data  using  software. 

These  data  compliment  the  whosonfirst- 

properties  https : // github.com/whosonfirst/whosonfirst-properties  and 
whosonfirst-placetypes  https  :  // github .  com/whosonf  irst/whosonf  irst- 
piacetypes  data  respectively. 


"Who's  On  First" 


Boarding  Area  B  (2017-) 


Boarding  Area  B  is  a  boarding  area  (sometimes  called  a  concourse)  and  is  parented  by 
Terminal  1  (2017-)  which  is  a  terminal.  Its  permanent  ID  is  1159396217. 


These  SFO  Museum  metadata  are  relevant  because  if  you  look  at  the  record  for,  say, 

Boarding  Area  B 

(2017)  https  :  /  /mil  Is  field,  sfomuseum.org/id/1159396217/  you'll  see  that  it 
is  described  as  " a  boarding  area  (sometimes  called  a 
concourse ) " . 

Inspecting  at  the  raw 

data  https : //mills field. sfomuseum. org/data/1 15/939/62 1/7/ 11593962 17 . g 
eo  j  son  for  that  record  reveals  the  following  properties: 


"properties":  { 

"sfomuseum: pi ace type" : "boardingarea" , 
"wof :placetype" : "concourse" 

> 


So,  which  is  it?  The  short  answer  is  "both"  which  is  an  unsatisfying  answer,  not  to 
mention  a  little  confusing. 


The  longer  answer  is  that  every  SFO  Museum  data  record  is  also  a  Who's  On 
First  https://www.whosonfirst.org  (WOF)  record,  so  in  the  eyes  of 
WOF  https://github.com/whosonfirst/whosonfirst-placetypes  the 
boarding  area  is  in  fact  a 

"concourse  https : //mil Is field. sfomuseum.org/placetypes/concourse/  " 

The  long  answer  is  to  say,  more  precisely,  that  all  the  SFO  Museum  data  is 
compatible  and  holds  hands  with  WOF  data  but  is  not  part  of  the  core  WOF 
datasets. 


Mills  Field 

Placetypes 

These  are  not  all  the  placetypes  in  the  world,  just  the  ones  that  SFO  Museum  is 
concerned  with  right  now.  Some  placetypes  are  the  same  or  similar  but  have  different 
names  (for  example  concourse  and  boarding  area)  while  others  are  museum  specific 
(public  art  or  gates )  but  share  a  common  name  that  encompasses  both  of  them  (venue). 

This  list  and  the  number  of  items  associated  with  each  placetype  will  grow  over  time. 

Venues  and  Public  Art  and  Gates  and  Concourses  and  Boarding  Areas  and  Galleries 
and  Enclosures  and  Wings  and  Terminals  and  Buildings  and  Ticketing  Areas  and 
Counties  and  Regions  and  Postal  Codes  and  Countries  and  Campuses  and  Airports  and 
Localities 


In  order  to  play  nicely  with  WOF  the  record  for  Boarding  Area 
A  https : //mi 11s field. sfomuseum. org/ id/ 1 1593962 17/  needs  to  assert  that 
its  WOF  placetype  is  a  "concourse".  In  order  for  that  same  data  to  make  sense  in  an 
SFO  Museum  context  it  also  states  that  its  SFO  Museum  placetype  is  a  "boarding 
area  https : //mills field. sfomuseum.org/placetypes/boardingarea/  ". 

I've  mentioned  Who's  On  First  https://www.whosonfirst.org  (WOF)  a 
couple  times  already  and  said  earlier  that  it  is  "a  giant  phonebook  of  places,  rather 


than  people" .  An  abbreviated  version  of  WOF's  own  "What  is  Who's  On 
First?"  https  : //www. whosonf irst . org/what/  page  States: 


•  It  is  an  openly  licensed  dataset.  At  its  most  restrictive,  data  is 
published  under  a  Creative  Commons  By-Attribution  (soon  to 
be 

CDLA  https : //whosonfirst.org/blog/2018/08/21/licens 
e-change/  )  license. 

•  Every  record  in  Who’s  On  First  has  a  stable  permanent  and 
unique  numeric  identifier.  There  are  no  semantics  encoded  in 
the  IDs. 

•  At  rest,  each  record  is  stored  as  a  plain-text  GeoJSON file. 

•  Files  are  stored  in  a  nested  hierarchy  of  directories  derived 
from  their  IDs. 

•  There  are  a  common  set  of  properties  applied  to  all  records 
which  may  be  supplemented  by  an  arbitrary  number  of 
additional  properties  specific  to  that  place. 

•  There  are  a  finite  number  of  place  types  in  Who’s  On  First  and 
all  records  share  a  common  set  of  ancestors .  As  with  properties, 
any  given  record  may  have  as  complex  a  hierarchy  as  the 
circumstances  demand  but  there  is  a  shared  baseline  hierarchy 
across  the  entire  dataset. 

•  Individual  records  may  have  multiple  geometries  or  multiple 
hierarchies  and  sometimes  both. 


•  Records  may  be  updated  or  superseded,  cessated  or  even 
deprecated.  Once  a  record  is  created  though  it  can  never  be 
removed  or  replaced. 


Earlier  this  year,  in  a  blog  post  about  maps  biog/20i8/07/3i/maps/  we  said: 


Location  and  place,  as  you  might  imagine,  are  core  to  an  airport.  They 
are  essential  to  any  museum  and  museum  collection,  really,  but  for  the 
sake  of  this  blog  post  we’ll  just  say  that  location  and  place  are  core  to 
our  museum. 


With  that  in  mind  we  have  chosen  to  use  Who's  On 

First  https :  //whosonfirst . org/  (WOF)  to  describe  all  the  places,  out  there  in 
the  world,  that  intersect  with  the  airport,  the  museum  and  our  collection.  We  have 
also  chosen  to  use  WOF  to  model  things  inside  the  airport,  the  museum  and  our 
collection.  The  data  model  used  by  WOF,  its  flexibility  in  accomodating  metadata 
unique  to  SFO  Museum  and  the  large  body  of  tools  for  working  with  WOF 
data  https://github.com/whosonfirst/whosonfirst  specifically  and 
GeoJSON  http://geojson.org/  more  generally  make  it  an  attractive  choice. 


Full  disclosure:  I  have  been,  and  continue  to  be,  intimately  involved  with  the 
Who's  On  First  project  https://www.whosonfirst.org/biog/  since  its 
creation.  In  order  to  explain  why  I  think  WOF  is  relevant  in  a  museum  setting  I  need 
to  take  a  brief  detour  and  tell  a  short  Cooper 
Hewitt  https://labs.cooperhewitt.org/  Story: 


SEARCH  THE  COLLECTION  Q,  MENU = 


COUNTRIES 


France 

We  have  24,961  objects  (about  11.587o  of  our  online  collection)  and  know 
about  193  people  from  this  country. 


There  are  84  countries  represented  in  our  online 
collection  spanning  78,373  objects,  which  means 
that  about  36.367o  of  all  our  stuff  has  a  country  to 
call  home. 


United  States 

We  have  23,105  objects  (about  10.727o  of  our  online  collection)  and  know 
about  374  people  from  this  country. 

Italy 

We  have  14,423  objects  (about  6.69%  of  our  online  collection)  and  know 
about  127  people  from  this  country. 

United  Kingdom 

We  have  3,764  objects  (about  1.757o  of  our  online  collection)  and  know 
about  149  people  from  this  country. 


A  note  about  the  countries  listed  here:  These  are 
only  those  countries  for  which  we  were  able  to 
successfully  match  an  object's  stated  location 
against  the  Where  on  Earth  (WOE)  gazetteer. 
Eventually  every  object  will  have  a  unique  gazetteer 
ID  (whether  it's  WOE  or  the  Getty's  thesaurus  of 
named  places  or  some  other  source)  but  that's  not 
the  case  at  the  moment,  partly  because  in  the  push 
to  release  the  beta  version  of  the  website  we've 
opted  for  things  that  could  be  solved 
programatically  with  a  high  degree  of  confidence 
and  left  things  that  need  extra  checking  to  identify, 
whether  by  humans  or  machines,  for  later. 


Way  back  in  2012  when  the  Cooper  Hewitt  collection 

website  https://coiiection.cooperhewitt.org/  was  being  built  we  took  the 
limited  geographic  information  (countries)  associates  with  each  object  and 
"geocoded"  it. 

Geocoding  is  the  name  used  to  describe  the  process  of  taking  a  name  or  an  address 
for  a  place  and  associating  it  with  a  latitude  and  longitude  or  some  other  fixed, 
stable  identifier.  Geocoding  is  the  art  of  translating  fuzzy,  imprecise  names  or 
labels  https : //web . archive . org/web/20 180 1022 02 736 /https : //mapzen . com/b 
log/geocoding-places/  in  to  something  that  a  computer  can  refer  to 
unambiguously. 

We  used  the  Flickr  geocoding 

API  https : //www. flickr . com/services/api/ flickr .places . find.html  to 

convert  place  names  in  to  "Where  On 

Earth  https://en.wikipedia.org/wiki/GeoPlanet  "  (WOE)  IDs.  For 
example,  the  WOE  ID  for  Germany  is  23424829.  One  reason  these  IDs  are  useful 
is  that  they  allow  you  to  "list  all  the  objects  from  Germany"  without  having  to  worry 
about  spelling  ("Germany"  or  "germany"  or  "Deutschland"  or  "Jermaniya"  or 


"Llscjjinciluayiju"  or  and  so  on...)  or  the  need  to  perform  a  potentially 

expensive  spatial  query.  Instead,  the  problem  of  finding  objects  associated  with  a 
specific  place  is  reduced  to  a  simple  key-value  query  where  the  value  is  a  number, 
and  computers  and  databases  are  really  fast  at  finding  things  with  numeric  indices. 


Q.  Photos,  people,  or  groups 


San  Francisco  International  Airport 


Flickr  groups  about  San  Francisco  International  Airport  (show  more) 
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Fun  fact:  12521  721  is  the  WOE  ID  for  SFO  and  you  can  see  all  the  geotagged  Flickr  photos  taken  at  SFO  by  visiting 
https://flickr.com/places/SFO/  or  https: llflickr.com/ places! 12521 721/. 


Now  you  can  see  all  the  objects  in  the  Cooper  Hewitt  collection  grouped  by 
Country  http://collection.cooperhewitt.org/countries/  .  So  far,  SO  good. 
Except  that  about  a  week  later  one  of  the  curators  came  down  to  the  digital  offices 
and  said:  "You  know  that  object  you  say  is  from 


Germany  https : //collection . cooperhewitt.org/ objects /countries/ 2 34 2 
4  82  9/  ?  Well ,  it  was  actually 

Poland  https : //collection . cooperhewitt.org/objects/countries/234249 
2  3/  at  the  time..."  It  might  have  actually  been  the  other  way  around.  I  honestly 
don't  remember.  Fundementally,  the  issue  is  the 

Same  https : //en .wikipedia . org/wiki/Germany%E2%80%93Poland_border 
though. 

Where  On  Earth  https://en.wikipedia.org/wiki/GeoPlanet  (WOE)  is  the 
gazetteer  used  by  Flickr  and  a  precursor  to  Who's  On  First  (WOF).  One  of  the 
important  differences  between  the  two  gazetteers  is  support  for  historical  places. 
WOE  is  a  "linear"  gazeteer  that  only  focuses  on  the  present  moment.  WOF  on  the 
other  hand  starts  with  the  notion  that  while  places  change  over  time  there  are  real 
and  important  use-cases  to  unambiguously  refer  to  that  place  at  a  moment  in  time. 


The  metadata  for  the  offending  object  from  Germany  was  never  updated  because 
WOE  doesn't  know  about  19th  century  Poland.  This  situation  can  only  be  described 
as  "not  great"  when  it  comes  to  museum  collections. 

Historical  data 

Here's  another  example  taken  from  Stephen  Epps'  blog  post  Tackling  Space  and 
Time  in  Who's  On 

First  https : //whosonfirst . org/blog/2 017/06/2 9/tackling-space-and-time- 
in-whosonfirst/  .  This  animation  depicts  the  three  distinct  instances  of  the 
former  "Yugoslavia"  during  the  20th  century  and  its  evolution  from  one  nation 
state  to  Seven  https : //spelunker .whosonfirst. org/search/ ?q=yugoslavia  , 

during  the  mid  1990s  to  the  present. 


There  are  two  important  conceptual  devices  at  play  here: 

Phase  shifts 

The  first  device  is  the  notion  of  a  "phase  shift" .  A  phase  shift  describes  when  a 
given  place  ceases  to  exist  or  is  replaced  by  a  new  place.  For  example,  when  the 

Kingdom  of 

Yugoslavia  https : //spelunker .whosonfirst.org/id/1108955789/  became 

the  Socialist  Federal  Republic  of 

Yugoslavia  https://spelunker.whosonfirst.org/id/1126113567/  in  1954. 


It's  not  always  clear  what  constitutes  a  "phase  shift"  or  why  and  that's  really  a  much 
larger  philosophical  question  than  we  have  time  to  address  here.  What  is  important 
for  our  purposes  is  describing  the  relationships  that  make  up  a  phase  shift. 


WOF  does  this  by  ensuring  that  every  record  has  a  wof :  supersedes  and  a 
wof :  supers eded_by  property,  with  pointers  to  the  other  constituent  records  in 
a  phase  shift. 

For  example,  between  1945  and  1991  the  Socialist  Federal  Republic  of 
Yugoslavia  https : //spelunker .whosonfirst.org/id/1126113567/ 
superseded  the  Kingdom  of 

Yugoslavia  https : //spelunker .whosonfirst.org/id/1108955789/  but  was 
then  superseded  by 

Croatia  https://spelunker.whosonfirst.org/id/85633229/  , 

Slovenia  https : //spelunker . whosonf irst . org/id/85633779/  and  a  now- 
smaller  version  https  :  //spelunker  .whosonfirst.org/id/1108955787/  of  its 
former  self: 


"properties":  { 

"wof : supersedes " :  [ 
1108955789 

]# 

"wof : super seded_by" :  [ 
1108955787 , 
85633779, 
85633229 


} 


//  Kingdom  of  Yugoslavia  (1918-45) 


//  Socialist  Federal  Republic  of  Yugoslavia  (1991-06-  - 
//  Slovenia  ( 1991-06-/1992-01-) 

//  Croatia  ( 1991-05-/1991-06- ) 


Another  example  might  be  the  "phase  shift"  that 

SFO  https : //mil Is field. sfomuseum.org/ id/1 19396331/  underwent  in  1979 
with  the  addition  of  North 

Terminal  https : //mil Is field. sfomuseum.org/ id/ 115 9 3 96 145/  and  then 
again  in  1981  with  the  addition  of  Boarding  Area 
E  https://millsfield.sfomuseum.org/id/1159396179/  : 


"properties":  { 

"wof : supersedes " :  [  1159396339  ], 


//  "SFO  (1979-81)"  supersedes  "SFO  (1974-79)" 


"wof : super seded_by" :  [  1159396327  ] 


//  "SFO  (1979-81)"  is  superseded  by  "SFO  (1981-83) 


> 

The  phase  shifts  described  in  our  data  only  track  major  changes  to  the  buildings  or 
the  terminals  that  passengers  might  experience.  The  opening  or  closing  of  a 
boarding  area,  for  instance,  but  not  behind-the-scenes  changes  like  modifications  to 
the  baggage-handling  infrastructure. 

The  "Extended  DateTime  Format"  (EDTF) 


Early  work  at  mapping  out  and  visualizing  the  relationships  between  all  the  terminals  and  boarding  areas  and  " instances "  of  SFO. 


The  second  conceptual  device  is  the  use  of  the  Library  of  Congress'  Extended 
DateTime  Format  https  :  //www.  loc .  gov/standards/datetime/pre- 
submission .  html  (EDTF)  to  describe  dates.  EDTF's  strength  lies  in  its  rich 
syntax  https : //www. loc . gov/standards/datetime/pre- 
submission .  htmi#uncertain  to  describe  imprecise,  uncertain  and  sometimes 
disputed  dates.  Did  you  notice  the  funny  squiggles  (~)  at  the  end  of  some  dates  in 
the  examples  above?  The  EDTF  specification  states  that: 


The  characters  '?'  and  are  used  to  mean  "uncertain"  and 
"approximate"  respectively,  and  in  combination,  i.e. to  mean 
"uncertain"  as  well  as  "approximate". 


Sometimes,  as  in  the  case  of  the  former  Yugoslavia  there  is  active  debate  about 
when  a  place  began  or  ceased  to  exist.  Did  Slovenia  become  newly  independent  in 
June  1991  or  January  1992?  People  have  and  will  continue  to  debate  those  details 
and  sometimes  simply  noting  the  the  dispute  is  more  useful  than  being  another  voice 
trying  to  arbitrate  the  solution. 


"properties":  { 

"edtf rcessation" :  "open",  //  "open"  means  current  or  on-going  in  EDI 

"edtf : inception" :  "1991-06-/1992-01-",  //  sometime  around  the  dates  between  June 

} 

Other  times,  circumstances  don't  permit  an  exact  answer.  When  was  Boarding  Area 
C  rebuilt?  We  know  that  it  was  sometime  after 

1983  https : //mi 11s field. sfomuseum.org/ id/ 1159554 835/  but  also  before 
1988  /id/115/955/484/9/  . 

Most  data,  but  especially  cultural  heritage  data,  reflect  the  art  of  managing  imperfect 
or  absent  data  and  EDTF  allows  us  to  soften  the  sharp  edges  of  that  reality  while 
saying  something  about  a  subject. 


Ambiguity  and  absence 


Approximate  geometries 

You  may  have  noticed  that  some  of  the  geometries,  like  the  dates,  for  the  SFO 
architecture  data  are  rough  and  approximate .  I  like  to  refer  to  these  geometries  as 
"cardboard  cutout"  geometries. 


We  will  update  these  geometries  with  accurate  tracings  in  time  (especially  now  that 
we  have  historic  maps  biog/20i8/08/07/oid-maps/  )  but  until  we  then  we  are 
publishing  them  as  a  "good  enough  is  perfect",  or  at  least  "something  is  better  than 
nothing",  solution. 


"properties":  { 

"mz : is_approximate" :  1 

> 

Imperfect  geometries  are  denoted  by  the  mz  :  is_approximate :  1 
property  https : //github .com/whosonfirst/whosonf irst- 
properties/tree/master/properties/mz#is_approximate  and  you  should 
adjust  your  use  of  the  data  accordingly.  The  easiest  way  to  think  about  approximate 
geometries  is  to  understand  that  they  are  more  "there"  than  not. 


For  example  the  old  Boarding  Area 

A  https : //millsfield. sfomuseum.org/id/1159554845/  is  guaranteed  to  be 
located  inside  the  SFO 

Campus  https : //millsfield. sfomuseum.org/ id/ 10252 7513/  and  not,  say, 
across  the  highway  in  San 

Bruno  https : //spelunker . whosonfirst .org/ id/ 85 922 44 1/  but  it  might  not 
be  exactly  there. 


Boarding  Area  A  (1988-  -  2000- ) 


Boarding  Area  A  is  a  boarding  area  (sometimes  called  a  concourse)  and  is  parented  by 
South  Terminal  (1988~  -  2000-)  which  is  a  terminal.  Its  permanent  ID  is  1159554845. 


"Null  Terminal"' 

Some  records  in  the  data  we've  published  belong  to  something  called  "Null 
Terminal".  Null 

Terminal  https://millsfield.sfomuseum.org/id/1159160869/  is  a 
MacGuffin  https://en.wikipedia.org/wiki/MacGuffin  and  a  play  on  the 
notion  of  Null  Island  https://en.wikipedia.org/wiki/Null_Island  ,a 
fictional  island  located  at  geographic  coordinates  of  0 . 0  ,  0.0. 


I  like  to  think  of  "Null  Island"  as  something  that  exists  as  a  place  without 
necessarily  have  spatial  properties.  For  example,  you  might  know  that  a  venue  is  in 
a  particular  neighbourhood  without  knowing  its  exact  coordinates.  That  venue  might 
be  considered  to  be  "visiting  Null 

Island  https://spelunker.whosonfirst.org/nullisland/  ". 


We  use  Null  Terminal  as  a  device  to  signal  (or  contain)  places  with  uncertain 
geographies  or  locations  that  we  can't  or  won't  share  with  the  public.  For  example, 

the  entirety  of  the  permanent 

collection  https  :  /  /mil  Is  field,  sfomuseum.org/  id/ 115  9 162  82  5/  Or  all  the 
loan  objects  https : //mil Is field. sfomuseum.org/ id/1159162827/  we've 
displayed  over  the  years  or  individual  works  of  public 

art  https : //mil Is field. sfomuseum.org/ id/ 772 9 7445 7/  that  may  not  be  on 
display  at  the  moment. 


Null  Terminal  (1980) 


Null  Terminal  is  a  building  and  is  parented  by  San  Francisco  International  Airport 
(1927-03-15)  which  is  an  airport.  Its  permanent  ID  is  1159160869. 


Null  Terminal  isn't  really  a  building.  We  might  change  its  (SFO  Museum)  placetype  in  the  future. 


Null  Terminal  https  :  /  /mil  Is  field,  sfomuseum.org/ id/ 1159 160869/  itself  is 
part  of  the  SFO  campus  https  :  /  /mil  Is  field,  sfomuseum.org/ id/102527513/ 
but  good  luck  trying  to  find  it. 


What's  next 

We  have  been  building  a  lot  of  scaffolding  and  plumbing  to  identify  and  promote 
those  aspects  of  the  airport  that  are  considered  "first-class  objects"  in  our  work  and 
to  ensure  they  have  a  stable  and  permanent  (a  "warm  and  fuzzy")  place  to  live  on 
the  internet. 


What's  next  is  to  continue  folding  in,  and  publishing,  more  and  more  of  the  museum 
collection.  We've  begun  at  the  macro-level 

(maps  https : //www. aaronland. inf o/weblog/2 0 18/07/3 l/things/#maps  )  and 
narrowed  the  focussed  to  the  airport  (historic  maps  /biog/20i8/08/07/oid- 
maps/  )  and  narrowed  it  again  to  specific  architecture  all  the  way  down  to  galleries 

(this  blog 

post  https : //www. aaronland. info/weblog/ 2 018/08/2 8 /things /#whosonfirs 
t  ). 


What's  next  is  to  find  out  what  happens  when  the  rest  of  the  collection  -  remember 
SFO  Museum  is  a  museum  and  a  library  and  an  aviation  collection  with  objects, 
people  and  places  from  everywhere  in  the  world  —  gets  to  play  in  that  same 
sandbox. 
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successful  distractions 

be  the  diorama  you  want  to  take  a  selfie  with 


be  the  diorama  you  want  to 
take  a  selfie  with 


Last  week  I  had  the  pleasure  of  delivery  the  opening  keynote  for  the 

Conference  on  Mobile  Position  Awareness  Systems  and 

Solutions  https : //www.exploratorium.edu/visit/calendar/c 
ompass  ( COMPASS),  organized  by  Claire  Pillsbury,  at  the 
Exploratorium  in  San  Francisco. 


This  talk  felt  like  a  bit  of  tightrope  walk  and  1  was  very  conscious  of 
not  wanting  it  to  fall  in  to  a  series  of  cascading  tangential  rabbit 
holes,  which  would  have  been  easy.  1  made  an  effort  to  keep  things 
curt  and  direct  so  much  so  that  my  sense  of  how  long  the  delivery 


would  take  was  completely  off-base  and  the  talk  was  over  sooner 
than  I  expected. 

If  I  ever  have  the  chance  to  do  this  talk  again  there  are  a  few  places 
that  would  benefit  from  a  longer  discussion  and  I've  tried  to  update 
the  text  where  necessary.  In  the  end  people  seemed  to  enjoy  the  talk 
and  it  seems  to  have  had  its  intended  effect  to  serve  as  a  lingering 
(but  friendly)  echo  throughout  the  rest  of  the  conference  proceedings 


To  be  honest  I  was  so  concerned  with  the  pacing  of  this  talk  that 
when  the  conference  organizers  didn't  do  the  usual  introduction  with 
credentials  ("So  and  so  worked  at...")  I  completely  forgot  to  fill  in 
those  blanks  myself. 


I  think  I  got  up  on  stage  and  said  something  like:  "Hi,  my  name  is 
Aaron.  I  know  some  of  you  in  the  audience.  These  days  1  work  at  the 
SFO  Museum..."  as  if  that  explained  much  of  anything.  For  the 
benefit  of  my  future -self  or  if  you  are  reading  this  weblog  for  the  first 
time  this  is  what  1  should  have  said: 


Hi,  my  name  is  Aaron.  I  am  the  Head  of  Internet 

Typing  https : / /millsf ield . sfomuseum.org/  at  the  San 

Francisco  International  Airport 

Museum  https://www.flysfo.com/museum/  .  As  silly  as  a  title 
like  that  may  sound  ( and  I  actually  have  an  even  more  vague  and 
opaque  official  title  for  HR  purposes)  it  is  the  best  way  to  describe 
my  role  at  the  museum.  I  am  charged  with  wrangling  this  crazy  little 
thing  we  call  the  "  internet "  in  the  service  of  a  nearly  forty-year  old 
museum  with  a  permanent  collection  of  over  100,  000  objects  that 
has  produced  1,200  exhibitions  across  50  gallery  spaces.  My  job  is 
to  make  all  of  these  things  visible,  legible  and  approachable  in  an 
environment  where  few  people  expect  to  find  a  museum  (an  airport ). 
In  the  past  I  have  been  part  of  the  teams  at 
Flickr  https://www.fiickr.com/  ,  Stamen 
Design  https  :  / / stamen .  com/  and 

Mapzen  https://en.wikipedia.org/wiki/Mapzen  where  the 
common  thread  has  been  maps  and  location.  I  was  also  intimately 
involved  with  re-opening  the  Cooper  Hewitt  Smithsonian  Design 
Museum  https://www.cooperhewitt.org/  in  2014  and 
launching  the  Pen  https  :  / / aaronland .  inf  o/cooperhewitt / 
which  has  a  little  something  to  do  with  indoor  location  in  museums. 
Thank  you,  to  the  organizers,  for  inviting  me  speak  and  thank  you,  in 
the  audience,  for  coming  to  listen. 


Travellers  can  now  monitor,  live  stream  and 
call  pets  while  flying 


Sensors  that  continuously  monitor  comfort,  automated  alerts,  live  video 
streaming  and  voice  chat  remove  the  stress  of  air  travel  for  pets  and  their 
owners. 


"what  would  airports  do?" 

_ ^  ■ 

WANDERLUST:  The  technology  complies  with  IATA  regulations 


A  few  years  ago  before  I  started  working  at  SFO  I  did  a 
talk  https : / /www. aaronland. info/weblog/2  0 15/ 11/09/keinhol 
z/#mcn  where  I  argued  that  the  museum  sector  could  learn  a  lot,  at 
least  from  an  operations  perspective,  from  airports.  I  am  not  going  to 
revisit  that  discussion  today  except  to  say  that  when  it  comes  to  the 
future  of  indoor  positioning  and  locative  technologies  museums 
would  continue  to  benefit  from  looking  at  what's  going  on  in 
airports. 


These  days  I  have  daily  exposure  to  a  long  list  of  airplane  and  travel 
industry  trade  publications.  If  you  look  past  the  hyperbole  it  is 
possible  to  see  the  near-future  of  consumer-grade  location  and 
tracking  technologies  being  deployed  to  a  mass  audience. 

I  am  going  to  skip  the  laundry  list  of  examples  and  just  mention  that 
if  Apple's  "Pay  with  your  face"  ad  campaign  has  ever  left  you  feeling 
uneasy  you  might  want  to  buckle  up  because  it's  probably  going  to 
get  a  lot 

Weirder  https : / /harpers . org/ archive/ 2018/ 07 /known- 
unknowns/  before  it  doesn't. 


Airports  can  be  distinguished  from  museums  by  the  volume  of  traffic 
they  service  and  the  security  demands  placed  on  them.  This  means 
they  enjoy  larger  technolgy  budgets  and,  generally,  fewer  questions 
about  consent  so  it's  not  always  an  apples-to-apples  comparison. 


Still,  an  aiport  remains  a  useful  space  to  watch  technologies  being 
introduced  and  normalized  (or  not)  and  can  serve  as  a  guide  for 


museums. 


a  series  of  awkward 
questions 


Today  I  would  like  to  use  the  opportunity  of  standing  up  here 
delivering  the  opening  keynote  not  so  much  to  discuss  the  specifics 
of  indoor  technologies  (we  have  the  next  two  days  for  that)  but 
rather  to  use  these  technologies  that  enable  the  work  we  do  as  a  lens 
back  on  to  that  work  itself. 


I  would  like  to  ask  some  questions,  awkward  questions  even,  that  do 
not  have  ready  or  easy  answers.  I  want  to  ask  these  questions  with 
the  hope  that  even  left  unanswered  they  may  serve  as  a  useful  foil  for 
the  rest  of  the  conference. 


and  a  caution: 


I'd  also  like  to  start  with  a  caution,  something  to  keep  in  mind  as  this 
event  gets  underway. 


disney  is  to  the  museum 
sector  what  google  is  to 
the  technology  sector 


It  is  that  Disney  is  to  the  museum  sector  what  Google  is  to  the 
technology  sector. 


photo  by  aloha75 


What  I  mean  to  say  is  that  using  Disney  as  a  frame  of  reference 
when  discussing  what  is  possible  in  the  museum  space  in  2018  is  an 
unhelpful  distraction.  Saying  "But  Disney  does  it..."  has  become  the 
trickle-down  economic  theory  of  technology  development  in  the 
museum  sector. 

It  also  does  a  real  disservice  to  the  genuinely  hard  work  these 
companies  have  done  and  continue  to  accomplish.  Google  and 
Disney  exist  in  universes  of  their  own  creation,  complete  with  air 


that  we  don't  breathe,  but  it  is  important  to  recognize  that  they  have 
(mostly)  earned  it. 


It  is  important  to  recognize  that  there  is  a  network  effect  in  each  of 
their  individual  efforts  and  that  those  efforts  play  themselves  out 
over  multiple  projects  often  spanning  years  and  many  failures.  The 
museum  sector,  both  individually  and  collectively,  remains  unable  or 
unwilling  to  make  those  kinds  of  investments  or  to  take  equivalent 
risks  and  the  results  are  predictably  un-Disney-esque. 


I  wish  the  museum  sector  would  endeavour  to  create  Disney- scale 
infrastructures  and  tooling  in  service  of  cultural  heritage.  Until  it 
does  though  maybe  we  can  just  admit  that  talking  about  Disney  does 
nothing  to  address  the  needs  and  concerns  we  face  on  a  day-to-day 
basis? 


photo  by  SFO  Museum 


A  few  years  ago  my  former  colleague  Seb 

Chan  http:  //www.  sebchan.com/  and  I  were  speaking  to  a  Large 
Museum  about  their  digital  initiatives.  After  we  left  Seb  commented 
that  the  last  thing  they,  or  any  museum,  needed  was  another  in¬ 
gallery  recording  booth  for  telling  personal  stories.  Not  because  the 
stories  aren't  interesting  or  important  but  because  a  museum  is  often 
the  wrong  place  to  tell  them. 


People  may  not  be  comfortable  telling  the  stories  in  a  public  space 
or,  as  often  as  not,  people  don't  feel  a  story  they  know  is  theirs  to  tell. 


Museums  would  be  better  off,  Seb  suggested,  in  setting  up  phone 
lines  and  answering  machines  and  letting  people  call  in  at  some  later 
date. 

In  another  universe  we  would  have  built  this  and  I  hope  one  day 
someone  does  because  it's  a  genius  idea.  If  you'll  permit  me  to  gloss 
over  the  not-insubstantial  implementation  details  what  excites  me 
about  this  scenario  is  that  in  all  these  phone  calls  and  stories  I  see  a 
radio  station. 

Not  a  metaphorical  radio  station  but  an  actual  broadcast  channel  run 
and  managed  and,  yes,  curated  by  the  museum.  I  see  a  radio  station 
as  a  means  to  encourage  community  participation  and  as  a  way  to 
celebrate  and  foster  a  multipilicity  of  views  about,  and  the  nuance 
that  surrounds,  a  subject. 

I  see  it  as  a  way  for  the  museum  to  exist  meaningfully  beyond  the 
walls  of  the  building. 


Shameless  plug:  If  you  are  flying  out  of  SFO  on  your  way  home  and 
happen  to  be  flying  out  of  Terminal 

3  https : / /millsf ield . sfomuseum.org/ id/115/915/730/7/ 
there  is  an  entire  exhibition  of  radios  on 

display  https : //WWW . f lysf o . com/museum/exhibitions/ radio 
right  now.  If  you  are  flying  international  out  of  Boarding  Area 
G  https : / /millsf ield . sfomuseum.org/ id/115/915/733/7/ 
there  is  a  post-security  connector  to  Terminal  3  so  you  can  get  to  the 
exhibition  that  way  too. 


Another,  simpler,  take  on  the  idea  of  the  radio  station  can  be  seen  in 
some  of  the  work  that  Shelley 

Bernstein  https://www.shell7.nyc/about-shelley  has  done 
with  wearable  computers  at  the  Barnes 
Foundation  https  :  /  / medium .  com/bar nes - 

foundation/wearable/home  .  This  image  shows  work  they  did  to 
prototype  a  watch-based  application  that  will  show  you  stuff  nearby, 
include  some  sort  descriptive  text  and  an  image  and  offer  to  let  you 
save  that  object.  Personally,  I  don't  actually  care  about  the  images 
and  I've  never  been  convinced  that  low-cost  watch-scale  hardware 
could  show  you  more  than  a  handful  of  images  before  needing  to  be 
rebooted. 


What  I  like  about  this  project  is  that  it  points  not  just  to  the  potential 
of  a  future  idea  but  the  actual  mechanics.  These  are  sub- 100$ 
Android  watches  which  is  a  big  deal  because  it  means  that  the  cost  of 
hardware,  which  is  equal  parts  a  well-designed  and  robust  enclosure 
as  it  is  electronics  or  software,  becomes  a  realistic  line  item  in  a 
museum  budget. 

I  definitely  don't  care  that  it's  a  watch.  What  I  like  about  the  form- 
factor  though  is  that  (having  removed  the  wrist  strap)  it's  something 
you  can  carry  around  in  your  hand  or  on  a  lanyard  and  that  can  be 
operated  with  nothing  more  than  your  thumb. 


Basically  there  are  two  problems  that  you  need  to  answer  when 
you're  working  with  indoor  location  in  a  museum.  The  first  is:  Where 
am  I?  This  is  or  can  be  a  mostly  solved,  or  solved-enough  problem, 
using  techniques  like  wifi-triangulation.  The  second  more  urgent 
problem  is  always  the  same:  Now  what? 


Museums  tend  to  see  this  as  an  opportunity  to  get  in  front  of  people 
and  demand  all  of  their  attention.  What  if  instead  of  demanding  that 
visitors  constantly  context  switch  between  the  physical  and  screen- 
based  spaces  the  device  (the  watch)  simply  allowed  you  to  signal 
interest  in  or  awareness  of  an  object,  offering  a  minimal  amount  of 
feedback. 


That  a  touch-based  watch,  costing  less  than  100$,  gets  you  more  than 
half  way  there  is  important  because  there  is  still  a  non-trivial 
amount  of 

engineering  https : / /mw2015 .museumsandtheweb.com/paper/ str 
ategies-against-architecture-interactive-media-and- 
transf ormative-technology-at-cooper-hewitt/  making  that 
idea  in  to  a  tangible  reality  you  can  hand  out  to  visitors.  This  might 
sound  like  a  simplified  (and  way  cheaper)  version  of  the  Cooper 
Hewitt  Pen  https://collection.cooperhewitt.org/pen  and 
depending  on  how  you  framed  it  to  people  it  could  be. 


What  I  imagine  is  more  like  a  smushing-up  of  the  Pen  and  the 
Brooklyn  Museum's  Ask 

Brooklyn  https : / /hyperallergic . com/300564/ a-brooklyn- 
museum-app-encourages-visitors-to-ask-questions/  app  (and 
its  back-of-house  infrastructure  and  processes)  and  SFMoMA's 
location-aware  audio  guide 

application  https  :  //www.museumnext .  com/ insight/how-sfmoma- 
made-its-audio-f irst-location-aware-app/  .  Specifically 
something  that  you  carry  around  and  whether  it's  through  active  or 
passive  signaling  generates  an  "audio  guide"  or  a  "podcast"  or  a 
"radio  program"  that  you  can  listen  to  after  you've  left  the  museum. 
Something  with  a  narrative  that  you  can  listen  to  while  you  make 
dinner,  for  example. 


Dave  Patten,  from  the  Science  Museum,  discussed  their 
experimental  " Museum  Egg " 

project  https : / / lab . sciencemuseum.org . uk/ science-museum- 
hack-guest-blog-by-david-burton-f rom-redweb-team- 
museumegg-smhack-5  910d2f  a5  81e  on  the  second  day  of  the 
conference.  While  not  exactly  what  I've  just  described  it  is  close 
enough  in  spirit  and  execution  that  it's  not  a  stretch  to  imagine.  Its 
size  and  case  design  aren't  the  most  elegant  but  with  the  caveat  that 
both  miniaturization  and  enclosures  are  tasks  guaranteed  to  make 
you  sad  it  remains  an  exciting  project. 


What  both  of  these  imaginary  projects  share  is  the  goal  of  giving 
visitors  a  reason  to  come  back  to  the  museum.  These  projects 
assume  that  you  will  miss  something  the  first  time  through  an 
exhibition.  You  might  miss  something  because  you  didn't  know  to 
look,  you  couldn't  be  bothered  to  wait  or  because  your  kid  ran  in  to 
another  room  and  then  life  took  over. 


They  are  premised  on  the  idea  that  in  many  ways  the  contemporary 
museum  experience  can  be  as  stressful  and  distracting  as  it  may  be 


enlightening.  They  are  premised  on  the  idea  that  in  many  ways 
museums  are  bad  at  being  museums. 


These  projects  try  to  imagine  a  way  to  talk  about  our  collections 
with  visitors  outside  and  beyond  the  museum  rather  than  talking 
them  in  the  moment.  This  is  after  all  the  novel  change  that  the 
internet  has  made  possible. 


awkward  question  time 


The  first  question,  and  the  reason  I  started  with  all  these  stories  of 
museums  operating  outside  the  building  is  simply  this:  Do  we 
believe  that  most  people  will  ever  visit  an  exhibit,  never  mind  the 
museum  itself,  twice? 


I  recognize  that's  a  fairly  provocative  statement  but  I  think  it's 
important  to  admit  that  it's  not  a  given  anymore.  It's  especially  not  a 
given  for  museums  whose  audience  is  predominately  tourism-based. 
Some  small  number  of  those  visitors  will  grace  a  museum's  doors 
twice  but  the  rest  of  us  are  just  boxes  to  tick  on  their  selfie  check- list. 


What  does  this  say  about  our  collections  and  our  work  when  one  of 
the  central  tenets  of  cultural  heritage,  of  the  humanities  really,  is  the 
value  and  a  celebration  of  prolonged  and  repeated  consideration  of  a 
work  or  a  subject? 


Are  we  exacerbating  the  situation  with  technology?  Are  we  using  all 
the  fancy  technology  we  have  at  our  disposal  to  create  single-use 
spaces  that  will  never  be  graced  by  the  same  person  twice?  Are  these 
spaces  the  disposable  coffee  cups  of  cultural  heritage? 

Or  does  the  technology  simply  demonstrate  that  the  only  common 
language  we  have  for  discussing  our  work,  as  museums,  with  most 
visitors  is  spectacle?  That  is  a  nice  way  of  asking:  Does  anyone 
really  understand  what  museums  are  trying  to  say,  or  why? 


Do  we  celebrate  the  experiential  museum  because  it's  the  only  place 
where  most  people's  lives  and  the  language  of  scholars  intersect?  Is 
this  mediated  and  emotionalized  half¬ 
language  https : / /www. wired .com/ story/ self ie-fac tor ies- 
instagram-museum/  the  only  shared  understanding  we  have  left? 


It  would  certainly  help  explain  the  success  of  projects  like  the  Color 
Factory  or  the  Ice  Cream  Museum  or  any  of  the  other  amusement 
parks  dressed  up  as  museums  that  have  opened  their  doors  in  recent 
years.  Why  should  we  be  surprised  by  their  success  if  their  basic 
outline  is  everything  I've  described  in  the  last  two  slides?  They  are 
spectacular  one-offs  with  no  shortage  of  technology  supporting  and 
manufacturing  a  guaranteed  experience  that  typically  manifests  itself 
as  a  collection  of  selfies. 


I  find  myself  wondering  though  whether  in  these  settles,  and  other 
ephemera  like  it,  we  are  seeing  visitors  grasping  at  straws  for 
something  like  the  contemplative  moments  we  used  to  say  define 
museums.  They  are,  by  and  large,  profoundly  isolated  moments  but 
they  are  something  tangible  that  exists  beyond  the  visit  itself. 

They  afford  a  revisitation  of  things  beyond  first  impressions. 


Speaking  of  revisiting  things  I  want  to  end  this  talk  with  a  short 
discussion  about  data  collection.  Some  of  you  may  remember  that 
earlier  this  year  Mark  Zuckerberg  was  summoned  Capitol  Hill  to 
address  questions  about  Facebook's  role  in  the  2016  elections  and  the 
general  chaos  that  defines  our  lives  these  days. 

Zuckerberg's  defense  can  be  roughly  paraphrased  as  "Gosh,  golly, 
gee  whilikers.  We  had  no  idea  that  people  would  do  bad  things  with 
our  platform."  This  is  an  argument  that  is  as  laughable,  coming  from 


a  company  that  employs  as  many  clever  people  as  Facebook  does,  as 
it  is  indefensible. 


Claiming  a  deficit  of  imagination  for  how  a  technology  might  be 
abused  is  not,  or  should  no  longer  be,  an  acceptable  excuse  in 
2018. 


We  have  long  relied  on  technological  cost  and  complexity  doubling 
as  a  barrier  against  our  own  worst  impulses,  both  external  and 
internal.  We  are  living  through  a  moment  when  all  of  those 
safeguards  are  being  shredded  in  front  of  our  eyes  by  advances  in 
hardware  and  software  and  the  mountain  of  data  those  advances  are 
producing. 


The  impulse  to  market,  sell  or  otherwise  leverage  the  data  that  we  as 
museums  collect  through  location-based  technologies  will  be  strong 
even  though  this  has  nothing,  and  I  mean  literally  nothing,  to  do  with 
the  "business"  of  cultural  heritage.  The  impulse  to  develop  museum 
exhibitions  solely  for  the  purpose  of  collecting  visitor  data  will 
surely  follow,  which  assumes  someone  isn't  already  doing  it. 


On  the  second  day  of  the  conference  Jen 

King  https : / / cyber law. Stanford . edu/ about /people/ jen- 
king  ,  Director  of  Consumer  Privacy  at  Stanford's  Center  for 
Internet  and  Society  https://cyberlaw.stanford.edu/  ,  did  an 


excellent  talk  digging  deep  in  the  nuts  and  bolts  of  these  issues. 
Hopefully  she  will  post  her  slides  somewhere  and  when  she  does  it 
should  become  required  reading  for  anyone  in  the  cultural  heritage 
sector. 

All  location  or  positioning  technology  trends  towards  identification 
at  the  atomic  or  individual  level.  If  we  are  going  to  deploy  these 
technologies  far  and  wide  within  our  institutions  then  I  want  to  leave 
you  with  a  final  question.  It's  a  question  that  I  think  we  not  only  need 
to  ask  ourselves  but  that  we  actually  need  be  able  to  articulate  an 
answer  for:  Are  museums  going  to  be  more  than  the  carrot  to 
surveillance  capitalism's  stick? 


thank  you 


Today 


i  predict  outlets  of  the  yayoi  kusama 
museum  in  all  the  major  airports  /  the  first 
to  open  sandwiched  between  the  Hermes 
and  Prada  stores  in  HKG  /  following  that 
the  CIA  will  secretly  fund  the  restoration 
of  T3  at  JFK  (and  if  there  is  any  justice  all 
of  the  JFK  while  they're  at  it...)  as  a 
celebration  of  jeff  koons  /  LHR  will  finally 
build  a  third  runway  and  hire  norman 
foster  to  design  its  new  terminal  after  a 
henry  moore  sculpture  with  enough  room 
for  an  indoor  petting  zoo  for  sheep  and  its 
companion  shops  selling  stuffed  toys  for 
children  on  long  haul  flights  /  the  french 
will  then  announce  that  they’ve  had  secret 
tunnels  running  from  the  louvre  to  la 
defense  to  the  original  T1  since  the  1970s 


There  is  more  than  one  way  to  answer  any  of  the  questions  I  have 
asked  today  but  let's  at  least  try  to  be  honest  with  ourselves  and  our 
visitors  in  what  we  say. 


Thank  you. 


2018-09-06 


things  I  have  written  about 
elsewhere  #201 81 004 

More  old  maps  and  more-better  architectures 


More  old  maps  and  more- 
better  architectures 


This  was  originally  published  on  the  SFO  Museum  Mills  Field 
Weblog  https : / /millsf ield . sfomuseum. or g /blog/20 18 /10/ 04 / 
architecture/  ,  in  October  2018. 


Maps 

In  an  earlier  blog  post  introducing  aerial 

maps  blog/2018/ 08/ 07 /old-maps/  on  the  Mills  Field  website 

we  said  that  our  goal  was  "  ...to  have  more  maps  to  choose 

from  https : / /www.dezeen. com/2018/10/ 01/kerim-bayer-maps- 


atlas-istanbul-design-biennial/  than  we  can  reasonably 
expect  to  fit  in  a  standard  text-based  list."  That  day  is  today. 

Thanks  to  the  hard  work  of  the  nice  people  in  the  SFO  GIS 
Department  to 

georectify  https*  /  /  gii  •  wi  kt  ionciiry  •  oir  c[  /  wiki  /  cjooirsctificciti 
on  new-found  imagery  we've  added  aerial 

maps  https://millsfield.sfomuseum.org/map  for  the  airport 
and  surrounding  area  for  the  years 

1946  https  : / /millsf ield. sf omuseum. org/map/#1946  , 

1956  https  : / /millsf ield. sf omuseum. org/map/#1956  , 

1965  https  : / /millsf ield. sf omuseum. org/map/#1965  , 

1970  http  s : / /millsf ield. sf omuseum. org/map/#1970  , 

1972  http  s : / /millsf ield. sf omuseum. org/map/#1972  , 

1997  http  s : / /millsf ield. sf omuseum. org/map/#1997  , 

1999  https://millsfield.sfomuseum.Org/map/#1999  and 
2006  https  : / /millsf ield. sfomuseum.org/map/#2006 


.org/map/#1965/bg/1 6/37.61 49/- 1 22.3871 


♦”  1965  "+ 


We've  also  started  importing  maps  and  filling  in  the  years  from  other 
sources,  notably  the  Aerial  Photography 

Collection  http:  / /mil .  library .  ucsb .  edu/ap_indexes/FrameF 
inder/  at  the  UC  Santa  Barbara 

Library  https : //WWW. library . ucsb . edu/ s rc /col lections - 
aerial-photography  ,  like  this  map  from  1965. 


https://millsfield.sfomuseum.Org/maD/#1972/bQ/15/37.6154M22.3884 

♦"  1972  Q  "♦ 


As  well  as  maps  from  SFO  Museum's  own  collection  like  this 
image  #  from  1972.  Do  you  notice  Rotunda 
A  id/115/939/627/7/  just  to  the  right  of  the  present-day 

International  Terminal 

A  https://millsfield.sfomuseum.Org/id/115/915/733/9/  ? 
We'll  come  back  to  that  shortly. 


We've  also  started  cataloging  all  of  these  maps  the  same  way  we  do 

architecture  and  other 

features  https : / /millsf ield . sfomuseum.org/blog/ 2018/08/28/ 
whosonf  irst/  that  are  relevant  to  the  airport  and  the  museum's 
collection.  This  is  early  work  and  subject  to  change  but  every  map 
we  display  is  included  as  a  GeoJSON  file  http :  / /geo  j son .  org/ 
complete  with  a  stable  identifier,  geographic  and  temporal  extents 
and  source  information. 

https://github.com/sfomuseum-data/sfomuseum-data-maps 


The  catalog  does  not  contain  individual  map  tiles  or  aerial  imagery 
and  is  available  from  the  sfomuseum- 


data  https://github.com/sfomuseum-data/  GitHub  account. 


Branch:  master  ▼  sfomuseum-data-maps  /  data  /  136  /  039  /  134  /  7  /  1360391347.geojson  Find  file  Copy  path 

thisisaaronland  polygons  for  1970  3018962  22  days  ago 

0  contributors 


Architectures 

We  have  also,  again  with  the  help  of  the  ever-fantastic  SFO  GIS 
Department,  updated  the  so-called  "cardboard  cutout"  geometries  for 
buildings  and  terminals  that  we  released  last  summer.  Those  early 
shapes  have  been  updated  by  tracing  from  the  aerial  imagery 
described  above  and,  in  certain  cases  like  Terminal  3,  merging 
contemporary  spatial  data. 


These  were  the  building 

geometries  https  :  / /millsf  ield.  sfomuseum.org/blog/2018/ 08/ 
2  8/whosonf  irst/  first  published  in  late-August  of  2018: 


These  are  the  building 

geometries  https  :  / /millsf  ield.  sfomuseum.org/placetypes/bu 
ilding  as  of  October  2018: 


Buildings 


Mills  Field 


There  are  15  of  them  /  see  all  the  placetvpes 


w 

ato  terminal  comics  l«U4-  - 

arojcrcLaa  l_coar  lex  _UOU  -  t 
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•xt 

;-c 

\ 

-  >;n  official  wetislte  I  is  a  wmk  in  progress  iti  umlersiaio  and  lo  snare  ihe  arc  ulour  Ulrecticnas 


We  still  need  to  update  the  geometries  for  boarding  areas  as  well  as 
finesse  some  of  the  terminals  but  that  will  happen  shortly. 

Why  are  all  these  different  "versions"  of  SFO  important?  Let's 
Consider  this  photo  https  :  / /WWW.  f  lysf o .  com/museum/ aviation- 
museum-library/collection/17014  of  the  Rotunda  A 
extension  https : / /millsf ield . sfomuseum.org/ id/ 115/939/62 
7 / 7 /  to  South  Terminal,  taken  while  it  was  under  construction. 


negative:  San  Francisco  International  Airport  (SFO),  South  Terminal,  Rotunda  A  construction,  1971,  Collection  of  SFO 

Museum  2011.032.2500 


Associating  the  photo  with  this  SFO  (from  1963- 

74)  https : / /mills field . sfomuseum.org/ id/115/939/632/5/ 

might  make  sense  since  it  was  taken  in  1971 . 


SFO  Terminal  Complex  (1963-  -  1974-) 


SFO  Terminal  Complex  is  a  building  and  is  parented  by  San  Francisco  International  Airport 
( 1927-03-15)  which  is  an  airport.  Its  permanent  ID  is  1159396325. 


The  rule  of  thumb  though  for  a  "phase 

shift"  https : / /whosonf irst . org/blog/20 16/ 10/ 06/wof- 
lif ecycie-documentation/  (when  one  SFO  is  superseded  by 
another)  is  the  requirement  that  a  change  be  visible  and  tangible  to 
passengers.  Rotunda  A  wasn't  opened  until  1974  so  arguably  this 
SFO  (from  1974- 

79)  https : / /mills field . sfomuseum.org/ id/115/939/633/9/ 
is  more  correct. 


This  SFO  (before 

1963)  https : / /millsf ield . sfomuseum.org/ id/ 115/939/632/9 


/  definitely  doesn't  make  any  sense  since  no  part  of  South  Terminal 
(now  Terminal  1)  had  been  built  yet. 


Nor  does  this  SFO  (from  1979- 

81)  https : / /mills field . sfomuseum.org/ id/115/939/633/1/ 
make  any  sense  since  North  Terminal  (now  Terminal  3)  wasn't  built 
until  1979  eight  years  after  the 

photo  https : / /www. f lysf o . com/museum/ aviation-museum- 
library/collection/17014  was  taken. 


All  of  these  many  SFOs  are  important  because  they  help  to 
contextualize  things  (like  the 

photo  https : / /www. f lysfo . com/museum/ aviation-museum- 
library/collection/17014  of  Rotunda  A)  in  the  moment  but 
also  to  demonstrate  how  that  context  has  changed  over  the  years. 


We've  also  begun  to  add  other  buildings  on  the  SFO  campus,  notably 

the  original  San  Francisco  Airport  Administration 
Building  https  :  //millsf  ield .  sfomuseum.org/ id/ 136/ 066/5  0 


4/7/ 


San  Francisco  Airport  Administration  Building  (1941-  -  1954-) 


San  Francisco  Airport  Administration  Building  is  a  building  and  is  parented  by  San  Francisco 
International  Airport  (1927-03-15)  which  is  an  airport .  Its  permanent  ID  is  1360665047. 


The  Administration  Building,  long  since 

gone  https : // git hub . com/whosonf irst-data/whosonf irst- 
data/ issues/1336  ,  sat  on  what  is  now  the  Zebra  taxiway  which 
runs  alongside  the  1  OR  /  2  8L  runway  frequented  by  many  of  the 
large  wide-body  planes  flying  across  the  Atlantic  or  the  Pacific. 
These  include  the  Lufthansa  A380  that  flies  from  San  Francisco  to 
Frankfurt  https : / /WWW. sfgate.com/chris- 
mcginnis/article/2-more-Airbus-A380s-for-Calif ornia- 
12631470. php  . 


This  plane  often  takes  off  from  the  Eastern  edge  of  the  airport 
(runway  2  8R)  but  takes  the  long  way  to  get  there,  looping  over  the 
Western  end  of  the  two  runways  (10R/28L  and  1 OL/2  8R)  and 
traveling  the  length  of  the  runway  again  before  turning  around  for 
departure. 


Which  means  that  every  day  this  A3 80  lumbers  through  the  ghost  of 
the  original  San  Francisco  Airport  Administration  building.  The 
general  consensus  around  the  office  is  that  the  A3  80  is  probably 
bigger  than  the  building  itself. 


What  did  catch  my  attention  were  the  cameras  mounted  on  the  outside.  As  we  taxied  to  the 
runway,  I  started  checking  out  the  different  views,  including  one  feature  that  shows  the 
airplane  overlaid  on  a  very  detailed  satellite  map. 


In  his  2017  review  Lufthansa  Business  Class  from  San  Francisco 

to  Frankfurt  https : / / www . travelcodex . com/ review- 
luf thansa-busines s -class -from- san-franc isco-to- 

f  rankfurt/  Scott  Mackenzie  mentions  the  A380's  exterior 
cameras  saying:  "As  we  taxied  to  the  runway,  I  started  checking  out 
the  different  views,  including  one  feature  that  shows  the  airplane 
overlaid  on  a  very  detailed  satellite  map." 


It  is  fun  to  dream  up  all  the  things  we  might  build  going  forward, 
whether  it's  for  a  seat-back  screen  or  a  personal  device,  as  SFO 
Museum  releases  more  and  more  geospatially  (and  temporally) 
enabled  data  about  the  airport! 

Distributions 


Finally,  for  developers: 


SFO  Museum  sqlite  Inventory 

This  file  was  generated  by  robots  on  201a  - 10-03117 : 41  s3oz.  There  is  also  a  machine-readable  version  available  at  inventorv.ison  as  well  as  Atom  1 .0  and  RSS  2.0  feeds.  As  of  this  writing  “alt”  (or 
“alternative”)  files  are  not  included  in  any  of  the  distributions. 

•  sfomuseum-data-architecture-latest.db.bz2  5d21819c7e9fe0974952c478f416b69f3575fc1739431a5f9fe85bafdf323435 

This  distribution  contains  4 7 0  SFO  Museum  records  and  is ; .  0  mb  compressed  and  6.3  kb  uncompressed. 

It  was  created  2018-10-03117= 39  447  and  (the  data  itself)  was  last  updated  on  2018-10-03116=53=172. 

•  sfomuseum-data-architecture-1 538585597.db.bz2  5d2l8l9c7e9fe0974952c478f4i6b69f3575fcl73943ia5f9fe85baldf323435 

This  distribution  contains  470  SFO  Museum  records  and  is  :  .0  mb  compressed  and  6.3  kb  uncompressed. 

It  was  created  2018-10-03117 1 39  ;  ■ .  and  (the  data  itself)  was  last  updated  on  20 1  s  10-03TI6: 53 : 172. 

•  sfomuseum-data-architecture-1 535474688.db.bz2  52ca5ed  1 4a6520e49d90e8c551 9857f71  C06b44075a9d  1  dc3e8a94 1 1 19107570 

This  distribution  contains  295  SFO  Museum  records  and  is  -.57  kB  compressed  and  .1  kb  uncompressed. 

It  was  created  2018-09-24120=47.447  and  (the  data  itself)  was  last  updated  on  2oib-o8-28ti6:44:4bz. 

•  sfomuseum-data-publicart-latest.db.bz2  8204e0257d0202b03db73edc1f6bc3a2b8a371501b3748b3betb94caa2dd42l2 

This  distribution  contains  77  SFO  Museum  records  and  is  64  kB  compressed  and  1  rb  uncompressed. 

It  was  created  2018-09-24120=47  =472  and  (the  data  itself)  was  last  updated  on  2018-09-19123  =  24  =  522. 

.  Sf0museum-data-publicart-1537399492.db.bz2  8204e0257d0202b03db73edc1f6bc3a2b8a37150fb3748b3befb94caa2dd42t2 

This  distribution  contains  77  SFO  Museum  records  and  is  sz  rb  compressed  and  59  =  kb  uncompressed. 

It  was  created  2018-09-24120=47  -  and  (the  data  itself)  was  last  updated  on  2018-09-19x23  =  24  =  522. 

•  sfomuseum-data-whosonfirst-latest.db.bz2  C736a2c03cad8cb10880a77b6a4b4i053947lac04b7if2a669cc3fe38l06b82a 

This  distribution  contains  «  SFO  Museum  records  and  is  2 . 0  kb  compressed  and  4  mb  uncompressed. 

It  was  created  2oi8-o9-24T2o  =  47  siz  and  (the  data  itself)  was  last  updated  on  2018-08-20x21=59=282. 

•  sfomuseum-data-whosonfirst-1534802368.db.bz2  C736a2c03cad8cb10880a77b6a4b410539471ac04b71f2a669cc3le38106b82a 

This  distribution  contains  e  SFO  Museum  records  and  is  2 . 0  mb  compressed  and  mb  uncompressed. 

It  was  created  2018-09-24x20=47  5  .  and  (the  data  itself)  was  last  updated  on  2018-08-20x21=59=282. 


A  reminder  that  in  addition  to  publishing  data  on  the  sfomuseum- 
data  https://github.com/sfomuseum-data/  GitHub  account 
we  are  also  publishing 


https : //mi 11s fie Id . sfomuseum.org/distributions/ s 


qiite/  distributions  of  all  our  data  (architecture,  public  art,  places 
and  more  soon!)  on  the  Mills  Field  website. 

For  an  introduction  to  why  SQLite  databases  are  interesting  I'd 
recommend  reading  Simon  Willison's  The  interesting  ideas  in 
Datasette  https  :  / / simonwillison .  net/2  0 18 /Oct/ 4/datasette- 
ideas/#atom-every  thing  .  Datasette  is  Simon's  tools  for  ''open 
source  tool  far  exploring  and  publishing  structured  data "  in  SQLite 
databases  and  generally  full -of- good . 


•  ••  +  |  ,  m,  1 1  6  ),  |  Q.  Search _ 1  f  ||  *  ||  O  »  A 


Smart  Feeds 

■ft  Today 

sfomuseum-data-architecture-1538585597.db.bz2 

ft  All  Unread 

10:39 

SFO  Museum  sqlite  distributions 

ft  Starred 

SFO  Museum  sqlite  distributions 

sfomuseum-data-architecture-latest.db.bz2 

This  distribution  contains  470  SFO  Museum  records  a... 

sfomuseum-data- 

10:39 

SFO  Museum  sqlite  distributions 

architecture- 
1 538585597.db.bz2 

Oct  3,  2018  at  10:39 

This  distribution  contains  470  SFO 

Museum  records  and  is  1.0  MB 

compressed  and  6.3  MB 
uncompressed.  It  was  created  2018- 
10-03T17:39:46Z  and  (the  data 
itself)  was  last  updated  on  2018-10- 

03T16:53:17Z. 

0- 

There  are  also  handy 

RSS  https  : / /millsf ield. sfomuseum.org/distributions/ sqlit 
e/rss.xml  (and 

Atom  https : / /millsf ield. sfomuseum.org/distributions/ sqli 
te/atom. xml  )  feeds  that  are  updated  every  time  a  new  (SQLite) 
distribution  is  published  so  you  can  be  informed  of  changes  to  our 
data  as  they  happen. 


https  ://millsfield  .sfomuseum  .or  g/ distributions/sqlite/ 


2018-10-04 


things  I  have  written  about 
elsewhere  #201 81 01 7 

1,318  exhibitions  in  38  years 


1,318  exhibitions  in  38  years 


aaron  10:22  AM 

https://millsfield.sfomuseum.org/exhibitions/1226612623/ 


SFO  Museum 

Exhibitions  |  The  Right  Foot  Show 

The  Right  Foot  Show  is  an  exhibition  and  its  permanent  ID  is 
1226612623.  This  nonaviation  exhibition  was  on  display  between 
October  1987  and  January  1988  in  the  F-02  North  Connector  gallery, 
located  in  North  Terminal  (Boarding  Area  F) 


This  was  originally  published  on  the  SFO  Museum  Mills  Field 

Weblog  https: / /millsfield. sfomuseum.org /blog/20 18/10/17/ 
exhibitions/  ,  in  October  20 1 8 . 


Update:  After  this  blog  post  first  went  live  we  discovered  that  some 
of  the  exhibition  records  https  :  /  /  github .  com/ sf omuseum- 
data/ sf  omuseum-data-exhibition  we  published  were  in  fact 
"bunk".  Museums  and  their  databases  are  no  more  immune  from 
these  sorts  of  hiccups  than  any  other  organization.  It's  unfortunate 
but  it  happens  sometimes.  The  reason  we're  mentioning  it  here  is  that 
depending  on  when  you  read  this  blog  post  the  number  of  exhibitions 
list  on  the  "  exhibitions  by 

year"  https : //millsfield . sfomuseum.org/ exhibit ions /year 
s/  page  might  be  lower  than  the  1,318  exhibitions  mentioned  in  the 
title.  At  the  rate  we  produce  exhibitions,  though,  it  won't  be  for  long. 
Think  of  it  as  an  opportunity  to  see  our  awesome  "  this  record  has 


been 

deprecated "  https : / /mil Is field . sfomuseum.org/exhibitions/ 
1360667807/  page! 


Since  1980  SFO  Museum  has  produced  1,318  exhibitions  (an 

average  of  34  per  year)  and  today  we  are  publishing  them 

all  https://millsfield.sfomuseum.org/exhibitions/  on  our 

website  and  as  an  openly-licensed  public 

dataset  https : / / github . com/ sf omuseum-data/ sf omuseum-data- 
exhibition  .  The  first  thing  many  people  will  notice  and  rightly 
ask  is:  Where  are  all  the  objects  that  were  part  of  those  exhibitions? 
The  simple  answer  is  that  they  are  not  part  of  the  Mills  Field  website 
yet  but  will  be,  soon. 


In  this  way  the  exhibitions  data  is  similar  to  the  first  release  of  the 
(SFO)  architecture 

data  https : //mills field . sf omuseum. org/blog/20 18 /08/2 8 /who 
sonf  irst/  -  a  kind  of  "cardboard  cutout"  data  that  gives  you  the 
shape  of  the  elephant  but  not  the  details;  something  that  can  be 

improved 

On  https : / /mills field . sf omuseum. org/blog/20 18/ 10/ 04 / arch 
itecture/  as  we  did  with  the  architecture  data. 


Even  though  the  data  may  be  "shallow"  in  its  depth,  right  now,  it  is 
rich  in  breadth.  You  can  view: 


Exhibitions  by 

type  https : / /mil Is field . sfomuseum.org/exhibiti 
ons/types/ 

Exhibitions  by 

year  https : / /millsf ield . sfomuseum.org/ exhibit i 
ons/years/ 


Exhibitions  currently  on 

display  https : / /millsf ield . sfomuseum.org/ exhibi 
t ions /on-display/  —  this  page  has  handy 
RSS  https : / /millsf ield . sfomuseum.org/ exhibit i 
ons/on-display/rss . xml  and 

Atom  https : / /millsf ield. sfomuseum.org/exhibit 
ions /on-display /atom,  xml  syndication  feeds  for 
keeping  track  of  new  exhibitions 

Exhibitions  on  display  pre¬ 
security  https : / /millsf ield . sfomuseum.org/exhib 
itions/pre- security/ 


Exhibitions  on  display  post¬ 
security  https : / /millsf ield . sfomuseum.org/exhib 
it ions /post- security/ 


Exhibitions  that  have  been  shown  in  a  given  terminal, 
for  example  Terminal 

1  https : / /millsfield. sfomuseum.org/terminals/1 
159157317/exhibitions/ 

Exhibitions  that  have  been  shown  in  a  given  boarding 
area,  for  example  Boarding  Area 

A  https : //millsfield . sfomuseum.org/boardingare 
as/H59554847/exhibitions/  between  the  years 

1988  and  2000 

Exhibitions  that  have  been  shown  in  a  given  gallery,  for 
example  Gallery  B- 

03  https : / /millsfield. sfomuseum.org/galleries/ 
1 159 157 077/exhibitions/all/  , near  Gate  36 


Down-Home  Music:  The  Story  of  Arhoolie  Records 

See  all  the  exhibitions  /  or  jump  to  another  random  exhibition 


This  nonaviation  exhibition  is  on  display  between  September  14th  2018  and  June  10th  2019  in  the 
D-12  Wall  Case  gallery,  located  in  Terminal  2  (Boarding  Area  D) 


You  might  be  wondering  " Hey,  where  are  the  objects  that  were  part  of  this  exhibition,  never  mind  the  130,  000  objects  in  your  collection?"  That's  a  pretty  reasonable 
question  to  ask.  The  short  answer  is  that  they  are  over  here  on  the  llysfo.com  website 

The  longer  answer  is  that  all  those  objects  and  more  will  be  here  shortly.  This  website  is  a  work  in  progress  and  we  are  working  our  way  towards  to  the  objects  from  the 
top-down.  We  began  at  50,000  feet  (so  to  speak)  with  maps,  followed  by  the  building  and  larger  campus  and  most  recently  galleries  and  exhibitions.  We  each  step 
we  get  closer  and  closer  to  the  objects  themselves. 


This  is  a  screenshot  of  the  webpage  for  the  Down-Home  Music: 

The  Story  of  Arhoolie 

Records  https : / /millsf ield. sfomuseum.org/exhibitions/136 
0664647/  exhibition  that  opened  last  month . 

It's  not  much  yet  but  it's  something  that  people  can 
share  https : / /WWW. theatlantic . com/ technology/ archive/2015 
/ 0 1/how-to-build-the-museum-of -the-f uture/ 384646/  and  it's 
something  that  starts  to  contextualize  an  exhibition  with  the  history 
of  the 

airport  https : / /millsf ield . sfomuseum.org/ airports/ 10252  7 
513/  .  For  example,  here's  the  webpage  for  the  D-12  Wall 


Case  https : / /millsf ield . sfomuseum.org/ galleries/ 1159157 

067/  gallery  that  "Arhoolie"  (as  it's  called  around  the  office)  is  on 
display  in. 


D-12  Wall  Case  (201 7~) 

This  gallery  is  in  Boarding  Area  D,  which  is  located  in  Terminal  2  /  see  all  the  galleries 
This  gallery  supersedes  D-12  Wall  Case  (201 4~  -  2017~) 


Exhibitions  that  have  been  shown  in  this  gallery 

Down-Home  Music:  The  Story  of  Arhoolie  Records 

This  nonaviation  exhibition  is  on  display  between  September  14th  2018  and  June  10th  2019  in  the 
D-12  Wall  Case  gallery,  located  in  Terminal  2  (Boarding  Area  D) 

Maneki  Neko:  Japan’s  Beckoning  Cat 

This  nonaviation  exhibition  was  on  display  between  February  2018  and  September  2018  in  the  D-12 
Wall  Case  gallery,  located  in  Terminal  2  (Boarding  Area  D) 

The  Typewriter:  An  Innovation  in  Writing 

This  nonaviation  exhibition  was  on  display  between  May  2017  and  January  2018  in  the  D-12  Wall  Case 
gallery,  located  in  Terminal  2  (Boarding  Area  D) 


See  all  the  exhibitions  displayed  in  this  gallery  over  time 


One  of  our  hopes  with  this  website  is  to  help  people  see  all  the  work 
that  the  museum  has  done  in  the  past. 

A  person  who  visits  Terminal 

2  https : / /mills field. sfomuseum.org/terminals/1159396121 

/  for  first  time,  in  2018,  and  sees  the  Arhoolie  show  should  also  be 
able  to  see  that  this  same  gallery  has  exhibited  19th  century  hand- 
carved  beckoning 

Cats  https : / /mills field . sfomuseum.org/exhibitions/115916 
0  649/  and  ouija 

boards  https : / /millsfield. sfomuseum.org/exhibitions/12266 

197  99/  and  maps  from  the  David  Rumsey 

collection  https  :  / /millsfield.  sfomuseum.org/exhibitions/ll 
59160469/  and  any  of  the  fourteen  other 

exhibitions  https : / /millsfield . sfomuseum.org/ galleries /I 15 
9 157 0  67 /exhibitions/all/  that  have  been  shown  since  2011. 


D-12  Wall  Case  (201 7-) 


Exhibitions  that  have  been  shown  in  this  gallery 
Dojwn^Home^Musig  Jbe  Story  of  AtboalifiJBegaals 

This  nonaviation  exhibition  is  on  display  between  September  14th  2018  and  June  10th  2019  in  the 
D-12  Wall  Case  gallery,  located  in  Terminal  2  (Boarding  Area  D) 

Maneki  Neko:  Japan’s  Beckoning  Cat 

This  nonaviation  exhibition  was  on  display  between  February  2018  and  September  2018  in  the  D-12 
Wall  Case  gallery,  located  in  Terminal  2  (Boarding  Area  D) 

The  Typewriter:  An  Innovation  in  Writing 

This  nonaviation  exhibition  was  on  display  between  May  2017  and  January  2018  in  the  D-12  Wall  Case 
gallery,  located  in  Terminal  2  (Boarding  Area  D) 

The  Mysterious  Talking  Board:  Qulla  and  Beyond 

This  nonaviation  exhibition  was  on  display  between  October  2016  and  May  2017  in  the  D-12  Wall  Case 
gallery,  located  in  Terminal  2  (Boarding  Area  D) 

A  Potter’s  Life:  Marguerite  Wildenhain  at  Pond  Farm 

This  nonaviation  exhibition  was  on  display  between  December  2015  and  October  2016  in  the  D-12  Wall 
Case  gallery,  located  in  Terminal  2  (Boarding  Area  D) 

Pixar’s  Toy  Story 

This  nonaviation  exhibition  was  on  display  between  November  2015  and  May  2016  in  the  A-01 
International  South  Wall  and  G-01  International  North  Wall  and  D-12  Wall  Case  galleries,  located  in 
International  Terminal 

Classic  Monsters:  The  Kirk  Hammett  Collection 

This  nonaviation  exhibition  was  on  display  between  May  2015  and  November  2015  in  the  D-12  Wall 
Case  gallery,  located  in  Terminal  2  (Boarding  Area  D) 

Studio  Glass,  The  Art  of  Marvin  Lipofsky,  Richard  Marquis,  John  Lewis  and  Elin  Christopherson 

This  nonaviation  exhibition  was  on  display  between  October  2014  and  May  2015  in  the  D-12  Wall  Case 
gallery,  located  in  Terminal  2  (Boarding  Area  D) 

San  Francisco.  From  the  David  Rumsey  Map  Collection 

This  nonaviation  exhibition  was  on  display  between  December  2013  and  October  2014  in  the  D-12  Wall 
Case  gallery,  located  in  Terminal  2  (Boarding  Area  D) 

Nissei  Baseball 

This  nonaviation  exhibition  was  on  display  between  December  2013  and  August  2014  in  the  D-12  Wall 
Case  gallery,  located  in  Terminal  2  (Boarding  Area  D) 

From  Ship  to  Shore:  Nautical  Arts  from  the  San  Francisco  Maritime  National  Historic  Park 

This  nonaviation  exhibition  was  on  display  between  May  2013  and  December  2013  in  the  D-12  Wall 
Case  gallery,  located  in  Terminal  2  (Boarding  Area  D) 

Let’s  Play!  1 00  Years  of  Board  Games 

This  nonaviation  exhibition  was  on  display  between  October  2012  and  May  2013  in  the  D-12  Wall  Case 
gallery,  located  in  Terminal  2  (Boarding  Area  D) 


1  2 


The  D-12  gallery  is  one  of  two  galleries  that  most  people  associate 
with  SFO  Museum.  The  other  is  the  F-02  North 
Connector  https : //millsf ield . sfomuseum.org/ galleries/115 
9157053/  in  Terminal  3  which  has  shown  79 

exhibitions  https : / /millsf ield . sfomuseum.org/ galleries /I 15 
9157053/exhibitions/all/  since  opening  in 
1981  http  s : //millsf ield . sfomuseum.org/ galleries/ 13605 16 
135/  .  "F-02"  is  so  much  associated  with  SFO  Museum  that  many 
people  and  some  websites,  like 

OpenStreetMap  https : / /www. openstreetmap.org/node/517232 
9024#map=i8/37 . 61923/-122 . 38720  think  it  is  the  museum.  But 
79  exhibitions  is  less  than  10%  of  all  the 

exhibitions  https : / /millsf ield . sfomuseum.org/exhibitions 
/years/  we've  ever  produced! 


We've  said  this  before  but  it's  worth  saying  again:  In  many  ways  the 

airport  itself  is  the 

museum  https://millsfield.sfomuseum.org/galleries/  and 
our  goal  in  publishing  all  the  exhibitions,  in  all  our  work,  is  to  help 
people  start  to  see  that. 


y  OpenStreetMap  Edit  -  History  Export 


GPS  Traces  User  Diaries  Copyright  Help  About  Log  In  Sign  Up 


Search  '-'JSQ 

Node:  SFO  Museum  (5172329024) 

Created  2  atms,  2  bars,  and  23  other  objects; 

Updated  3  fast_foods  and  a  gift  shop 
Edited  12  months  ago  by  Omnific 
Version  #1  ■  Changeset  053000413 
Location:  37.6192338.  -122.3872059 

Tags 

name  SFO  Museum 

tourism  museum 


Download  XML  •  View  History 


Boarding 
Area  E 


In  addition  to  adding  exhibitions,  we've  enabled  rudimentary 
Search  https : / /millsf ield . sfomuseum.org/ search 
functionality  for  the  website.  When  we  say  "rudimentary"  though  we 
mean  it.  It's  still  pretty  easy  to  trip  up  still  but  it's  a  start.  We  don't 
have  any  results  for 

"kittens"  https : //millsf ield. sfomuseum.org/ search/? 
q=kittens  but  we  do  have  results  for  queries  like 
"cats"  https : / /millsf ield . sfomuseum.org/ search/ ?q=cats 
or  "boeing"  https://millsfield.sfomuseum.org/search/? 
q=boeing  or  "mills 

field"  https : / /millsf ield . sfomuseum.org/ search/ ? 
q=miiis+f ield  or  even 


"football"  https : / /mills field . sfomuseum.org/ search/ ? 
q=football 


The  Nation’s  Game:  A  History  of  the  National  Football 
League 

See  all  the  exhibitions  /  or  jump  to  another  random  exhibition 


This  nonaviation  exhibition  was  on  display  between  September  2015  and  March 
2016  in  the  F-02  North  Connector  gallery,  located  in  Terminal  3  (Boarding  Area 
F) 


Three  random  photos  of  this  exhibition 

See  all  88  photos  /  or  view  a  random  photo 


Photo  by  SFO  Museum 


Photo  by  SFO  Museum 


Photo  by  SFO  Museum 


Careful  readers  may  notice  that  the  screenshot  above  and  the  page 
for  the  2015-16  exhibition  The  Nation’s  Game:  A  History  of  the 
National  Football 

League  https : //mil Is field . sfomuseum.org/ exhibitions/ 1 159 
16  0617/  (as  of  this  writing)  are  not  quite  the  same.  Think  of  it  as  a 
teaser  https : //mil Is field . sfomuseum.org/blog/ 2018/07/18/ 
iiif  /  for  what  comes  next. 


The  goal  of  this  website  is  to  share  the  arc  of  our 
direction  https : / /mills field . sfomuseum.org/blog/2018/ 07/11 
/welcome/  and  so  our  preference  is  to  push  a  lot  of  small  releases 
and  improvements  out  the  door  as  we  work  on  them  rather  than  long 
silences  punctured  by  so-called  "big  reveals".  The  exhibitions  data 
and  search,  both,  will  improve  over  time  but  whereas  neither  existed 
yesterday  these  little  "somethings"  feel  like  an  improvement  over 
"nothing". 


For  a  good  discussion  of  search  and  museum  websites  I'd 
recommend  Sam  Brenner's  Reconsidering  searching  and  browsing 
on  the  Cooper  Hewitt’s  Collections 

website  https : / /mw2015 .museumsandtheweb.com/paper/ recons i 
dering- searching-and-browsing-on-the-cooper-hewitts- 
coiiect ions -website/  and  Nate  Solas'  Hiding  Our  Collections 
in  Plain  Site:  Interface  Strategies  for 

" F indability"  https  :  / /WWW. museumsandtheweb.com/mw2010/pape 
rs /solas /solas  .html  both  delivered  at  the  Museums  and  the 


Web  https://www.museweb.net/  conference  in  201 5  and  2010 
respectively. 


The  next  and  final  section  in  this  blog  post  is  for  developers  so  if 
you're  not  interested  in  developer-y  things  you  can  stop  here.  You 
might  be  interested  instead  in  the  random  exhibition 

link  https : //mil Is fie Id . sfomuseum.org/exhibitions/ random 

we  created  which  allows  you  to  dive  in,  at  some  random  point  in 
time,  and  see  all  the  stuff  we've  done  in  the  last  38  years. 

Enjoy!  https : //millsf ield . sfomuseum.org/ exhibitions /rando 
m/ 


•  •  •  1  + 

Smart  Feeds 
Today 

$  All  Unread 

4^  Starred 
On  My  Mac 


•  Aviation  Resolutions:  The  Jim  Lund  1:72  Scale  Model 
Aircraft  Photographs  (2017-09-20  to  2019-03-11) 

13=51 

Exhibitions  on  display  at  SFO  Museum 

Exhibitions  on  display  at  SFO  Museum 

•  Kirk  Crippens  Gretchen  LeMaistre:  Live  Burls 
(2018-09-11  to  2018-12-11) 

13=51 

Exhibitions  on  display  at  SFO  Museum 

Down-Home  Music:  The  Story  of  Arhoolie  Records 
(2018-09-14  to  2019-06-10) 

13=51 

Exhibitions  on  display  at  SFO  Museum 

Noritaka  Minami: 
California  City, 

California  (2018- 
09-13  to  2018-12- 
13) 

•  Xiaoxiao  Xu  |  Aeronautics  in  the  Backyard 
(2018-08-14  to  2018-11-06) 

Oct  16,  2018  at  13:51 

13=51 

Exhibitions  on  display  at  SFO  Museum 

This  photography  exhibition  is  on  display 
between  September  13th  2018  and 

December  13th  2018  in  the  F-11 

Photographs  gallery,  located  in  Terminal  3 

Noritaka  Minami:  California  City,  California 
(2018-09-13  to  2018-12-13) 

13=51 

Exhibitions  on  display  at  SFO  Museum 

•  A  Sterling  Renaissance,  British  Silver  Design 

1957-2018  (2018-10-05  to  2019-05-17) 

13=51 

F.Mhitinn.  nn  rti.ntav  at  SFO  Muslim  - 

As  with  all  our  data  releases  we've  posted  the  exhibition  data, 

modeled  as  Who's  On  First 

documents  https : / /millsfield. sfomuseum.org/blog/2018/ 08/ 
2  8/whosonf  irst/  (exhibitions  are  considered  to  be 
"installations"  https  :  / / github.com/whosonfirst/whosonfirst- 
placetypes /blob/master/ placetypes/ installation . j  son  in 

the  nomenclature  of  Who's  On  First 

placetypes  https : / / github.com/whosonfirst/whosonfirst- 
placetypes#here-is-a-pretty-picture  )  and  with  pointers  to 
their  relevant  historical  gallery  and  architectural  features,  on  the 

sfomuseum-data  GitHub 

account  https://github.com/sfomuseum-data/  : 

https://github.com/sfomuseum-data/sfomuseum-data-exhibition 

This  data  is  the  same  data  that  is  used  by  this 

Website  https://millsfield.sfomuseum.org/collection  .This 
data  is  not  the  museum's  "source  of  truth"  but  a  derivative  product 
that  is  created  from  our  internal  collections  management  system,  a 
workflow  that  we'll  talk  about  in  a  separate  blog  post.  It  is  important 
to  us  that  we  use  the  same  data  we're  asking  other  people  to  use  for 
our  own  purposes  to  help  prove  (or  disprove)  that  it  is  both  robust 
and  flexible  enough  to  do  something  interesting  with. 


SFO  Terminal  Complex  (1983~  -  1988~) 

This  building  supersedes  SFO  Terminal  Complex  (1981~  -  1983~)  and  is  also  superseded  by  SFO  Terminal  Complex  ( 1988 -  •  2000~ ) 


Terminals  Boarding  Areas  Galleries  Exhibitions 


Exhibitions  that  have  been  on  display  in  this  building 


Turn  of  the  Century  French  Posters  from  the  Collection  of  Peter  Drevfuss 

This  nonaviation  exhibition  was  on  display  between  November  1988  and  December  1988  in  the  C-05 
Terminal  One  Lower  Gallery  gallery,  located  in  South  Terminal 

Doors 

This  nonaviation  exhibition  was  on  display  between  August  1988  and  October  1988  in  the  C-05 
Terminal  One  Lower  Gallery  gallery,  located  in  South  Terminal 

Carrot  Air  Force  bv  Tom  Foolery 

This  nonaviation  exhibition  was  on  display  between  November  1987  and  December  1987  in  the  A-06 
Gate  08  gallery,  located  in  South  Terminal 


It  is  important  to  us  to  prove  that  it  is  possible  and  manageable  to 
"recast"  the  same  data  in  any  number  of  databases  or  application 
frameworks  and  still  produce  meaningful  results.  This  is  as  much 
about  trying  to  produce  something  of  quality  for  others  to  consume 
as  it  is  about  long-term  sustainability  of  our  own  efforts. 

This  work  is  meant  to  test  these  ideas. 

2018-10-17 


things  I  have  written  about 
elsewhere  #201 81 030 

SFO  Museum,  Who's  On  First  and  Airports 


SFO  Museum,  Who's  On 
First  and  Airports 


Oakland  International  Airport 

This  airport  in  United  States  is  also  referred  to  as  OAK  or  sometimes  KOAK  /  See  all  the  airports  /  or  a  random  airport  /  or  just  SFO 


Leaflet  |  ©  OSM  contributors  |  Nextzen  |  Who's  On  First  |  SFO  Museum 


This  was  originally  published  on  the  SFO  Museum  Mills  Field 
Weblog  https : / /mills field . sfomuseum. org/blog/20 18/10/30/ 
airports/  ,  in  October  20 1 8 . 

There  are  between  4,000  to  5,000  medium  to  large 

airports  https : / / spelunker.whosonfirst.org/ placetypes/ca 
mpus/  in  the  world,  in  2018.  As  of  this  writing  about  300  are 
relevant  -  are  "holding  hands"  -  with  the  SFO  Museum 
collection  https  :  / /www.  f  lysf o .  com/museum/ aviation-museum- 
library/collection  .  As  of  today  those  (300)  airports  and  the 
countries  they  belong  to  have  a  home  on  the  Mills  Field 
Website  https://millsfield.sfomuseum.org/collection  . 


https://millsfield.sfomuseum.orgl airports!  https :  /  /m 
illsf ield. sf omuseum. org/ airports/ 


•  https://millsfield.sfomuseum.org/countries/  https:// 

mills field . sfomuseum.org/countries/ 


Like  the  exhibition 

Webpages  https : / /millsfield. sfomuseum.org/exhibitions/ 

that  we  discussed  in  the  last  blog 

post  https : / /millsfield . sf omuseum. or g/blog/ 2  018/ 10/ 17/exh 
ibitions/  there  isn't  much  "stuff"  to  see  yet,  save  a  small  amount 
of  metadata  for  each  location  and  a 

map  https : / /millsfield . sfomuseum.org/blog/2018/ 07/3 1/m 
aps/  .  There  will  be,  soon,  as  these  are  all  airports  that  have  a 
relationship  with  objects  in  our 

collection  https  :  / /www.  f  lysfo .  com/museum/ aviation-museum- 
library/collection  . 


Airports 

These  are  airports  that  hold  hands  with  our  collection  /  Jump  to  a  random  airport  /  or  just  SFO 


Los  Angeles  International  Airport 

This  airport  in  United  States  is  also  referred  to  as  LAX  or  sometimes  KLAX 
Paris-Le  Bouraet  Airport 

This  airport  in  France  is  also  referred  to  as  LBG  or  sometimes  LFPB 
LaGuardia  Airport 

This  airport  in  United  States  is  also  referred  to  as  LGA  or  sometimes  KLGA 
London  Gatwick  Airport 

This  airport  in  United  Kingdom  is  also  referred  to  as  LGW  or  sometimes  EGKK 
London  Heathrow  Airport 

This  airport  in  United  Kingdom  is  also  referred  to  as  LHR  or  sometimes  EGLL 
Lihue  Airport 

This  airport  in  United  States  is  also  referred  to  as  LIH  or  sometimes  PHLI 
«j9.rge-£hayfiz.ALrRQrt 

This  airport  in  Peru  is  also  referred  to  as  LIM  or  sometimes  SPIM 
Adams  Field  Airport 

This  airport  in  United  States  is  also  referred  to  as  LIT  or  sometimes  KLIT 

Lanai  Airport 

This  airport  in  United  States  is  also  referred  to  as  LNY  or  sometimes  PHNY 

Cincinnati  MunicjpaLAJrnort^unken_Field_a925) 

This  airport  in  United  States  is  also  referred  to  as  LUK  or  sometimes  KLUK 

Luxembourg  Airport  (193x) 

This  airport  in  Luxembourg  is  also  referred  to  as  LUX  or  sometimes  ELLX 
Lyon  -  Saint  Exupery  Airport 

This  airport  in  France  is  also  referred  to  as  LYS  or  sometimes  LFLL 


1  ...  8  9  10  12  13  14  ...  20 


You  can  reference  individual  airports  by  their  unique  Who's  On 
First  https://whosonfirst.org/  (WOF)  ID  as  well  as  their 
IATA  or  ICAO  codes.  For  example,  both  of  these  URLs  will  take 
you  to  the  Mills  Field  webpage  for  the  Pierre  Elliott  Trudeau 
International 

Airport  https : / /mills field . sf omuseum. or g /airports/ 10255 
4  351/  ,  in  Montreal: 

•  https://millsfield.sfomuseum.org/ airports! 1 02554351/ 

https : //mil Is field . sfomuseum.org/ airports/ 102 
554351/ 

•  https://millsfield.sfomuseum.org/airports/YUL/  https 
: / /millsfield. sfomuseum.org/airports/YUL/ 


The  same  is  true  of  countries,  which  can  be  addressed  by  their  WOF 
ID  or  their  ISO  country  code. 


Singapore 

See  all  the  countries  (that  hold  hands  with  SFO  Museum) 


Airports  that  are  in  this  country 

These  are  not  all  the  airports  in  Singapore  but  they  are  airports  that  hold  hands  with  SFO  Museum 


Kallanq  Airport  (1937-06-12  to  1955) 

This  airport  in  Singapore  was  also  referred  to  as  2890  or  sometimes  2890 

Paya  Lebar  Airport 

This  airport  in  Singapore  is  also  referred  to  as  QPG  or  sometimes  WSAP 
Singapore  Chanai  Airport 

This  airport  in  Singapore  is  also  referred  to  as  SIN  or  sometimes  WSSS 


For  example,  both  of  these  URLs  will  take  you  to  the  Mills  Field 
webpage  for 


Singapore  https : / /mills field . sfomuseum.org/ countries/ SG 
/  : 


•  https://millsfield.sfomuseum.org/countries/85632605/ 

https : / /millsf ield. sfomuseum.org/countries/85 
632605/ 

•  https://millsfield.sfomuseum.org/countries/SG/  https 
: / /millsfield. sfomuseum.org/countries/SG/ 


Did  you  notice  that  you  can  reference  airports  and  countries  by  their 
Who's  On  First  https  : //whosonf irst . org/  ID? 


Vancouver  International  Airport  (1968) 

This  airport  in  Canada  is  also  referred  to  as  YVR  or  sometimes  CYVR  /  See  all  the  airports  /  or  a  random  airport  /  or  just 
SFO 


https : //millsf ield . sfomuseum.org/airports/ 102555307/ 

Instead  of  minting  on  our  records  and  identifiers  for  common  places 
we  are  marrying  our  data  (buildings,  terminals,  exhibitions)  with  the 
pre-existing  and  continually  https:  //github.com/whosonf irst- 
data/whosonf irst-data/pull/1361 

updated  https : / / github.com/whosonfirst-data/whosonfirst- 


data /commits /master  Who's  On  First  dataset  for  administrative 
data  (airports,  cities,  countries).  We've  submitted  our  own 
Contributions  https :  /  / github .  com/whosonf irst- 
data/whosonf  irst-data/pull/1345  ,  back  to  the  WOF  project, 
adding  missing  or  historical 

airports  https : / / github . com/whosonf irst-data/whosonf irst- 
data/ pull  / 1354  where  necessary. 


Places  that  have  been  updated  recently 

There  are  8  places  that  have  been  updated  in  the  last  7  days 


(’query1  ('filtered':  (filter*  l  and':  [{'bool  (’musLnot’:  [(’exists1  {'field  :  edtf  oeprecated'))]||)).  0.0100870132446  seconds  J 

■query’:  ('range':  {'woftlastmodified':  I'gte':  1540234975, 

•Ite’:  1540839775)1111. 

'sort':  [{'woMastmodified':  ['mode':  max',  order':  'desc'})]) 


Would  you  like  try  this  query  again  to  include  records  that  have  been  deprecated  ? 
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the  map  is  currently  centered  at  15.623037,  -4.570313  #2 


1 360695643  Kemayoran  Airport  —  this  is  a  campus  in  Indonesia 
it  is  tagged  airport  it  was  last  modified  on  October  25 

1360695653  Mills  Field  Municipal  Airport  of  San  Francisco  -  this  is  a  campus  in 

United  States 

it  is  tagged  airport  this  record  has  been  superseded 

it  was  last  modified  on  October  25 

1 360695655  San  Francisco  Airport  —  this  is  a  campus  in  United  states 
it  is  tagged  airport  this  record  has  been  superseded 

it  was  last  modified  on  October  25 

1 36069565 1  Berlin  Brandenburg  Willy  Brandt  Airport  —  this  is  a  campus  in  Germany 

it  is  tagged  airport  it  was  last  modified  on  October  25 


filter  by  concordance 

Wikidata  6  International  Civil  Aviation  Organization  5 
International  Air  Transport  Association  4 


filter  by  geometry 

point  8 


filter  by  is  'current'  -ness 

not  current  6  unknown  i  current  i 


That  way  when  our  respective  projects  talk  about  the  Albuquerque 
International 

Airport  https : / /millsf ield . sfomuseum.org/ airports/ 10252 
9173/  (IATA  code  ABQ  or  WOF  ID  102529173)  we  can  be 
confident  that  we  are  talking  about  the  same 
thing  https://spelunker.whosonfirst.org/id/102529173/ 


Albuquerque  International  Airport 

This  airport  in  United  States  is  also  referred  to  as  ABQ  or  sometimes  KABQ  /  See  all  the  airports  /  or  a  random  airport  /  or  just  SFO 


Does  the  shape  of  this  airport  look  a  bit  weird?  That's  probably  because  the  geometry  was  derived  from  the  outline  of  geotagged  Flickr 
photos  and  a  better,  openly  licensed,  alternative  hasn't  been  found  yet. 


Rather  than  importing  all  2.3  million  administrative 
records  https : / / github . com/whosonf irst-data/whosonf irst- 
data/  from  Who's  On  First  we  have  cloned  only  those  records 
immediately  relevant  to  the  SFO  Museum  collection  and  the  Mills 


Field  website  in  to  our  own  sf  omuseum-data-whosonf  irst 
dataset: 

https :  //github  .com/ sfomuseum-  data/ sfomuseum-data- 

whosonfirst 


We  use  the  go-whosonfirst- 

fetch  https : / / github . com/whosonf irst/ go-whosonf irst- 
f  etch  command-line  tool  to  keep  copies  of  the  WOF  data  we've 
cloned  in  sync  with  the  source  data.  Like  this: 


$>  . /bin/wof-f etch 

-reader  ' reader=github  repo=whosonf irst-data '  \ 

-reader  ' reader=github  repo=whosonf irst-data-postalcode-us '  \ 

-writer  'writer=repo  root=/usr/local/data/sf omuseum-data-whosonf irst '  \ 
-belongs-to  country  \ 

-mode  repo  \ 

/usr/local/data/ sf omuseum-data-whosonf irst 


This  blog  post  is  not  the  place  for  a  detailed  discussion  of  the  go- 
whosonf  irst-f  etch  tool  but  briefly  this  is  what's  happening: 


•  We're  defining  two  sources  to  read  data  from:  The  WOF 
GitHub  repositories  for  administrative 
data  https : / / github . com/whosonf irst- 
data/whosonf irst-data/  and  US 
postalcodes  https  :  /  / github.com/whosonfirst- 
data/whosonf irst-data-postalcode-us/ 


We're  defining  a  source  to  write  data  to:  A  local 
checkout  of  the  sfomuseum-data-whoson  first 
repository. 

We're  asking  the  code  to  also  fetch  the  country  record 
for  each  matching  record  we're  cloning;  that's  what  the 
-belongs-to  flag  is  for. 

We're  reading  the  list  of  records  to  clone  from  the  same 
local  checkout  of  the  sfomuseum-data- 
whoson  first  repository  that  we  are  writing  data  to; 
that's  what  the  -mode  flag  is  for. 


South  Africa 

See  all  the  countries  (that  hold  hands  with  SFO  Museum) 


Airports  that  are  in  this  country 

These  are  not  all  the  airports  in  South  Africa  but  they  are  airports  that  hold  hands  with  SFO  Museum 


Bram  Fischer  International  Airport  fuuuu) 

This  airport  in  South  Africa  is  also  referred  to  as  BFN  or  sometimes  FABL 
Cape  Town  International  Airport  (1954) 

This  airport  in  South  Africa  is  also  referred  to  as  CPT  or  sometimes  FACT 
OR  Tambo  International  Airport  f 19521 

This  airport  in  South  Africa  is  also  referred  to  as  JNB  or  sometimes  FAOR 


The  sfomuseum-data-whosonf  irst  repository  also  has  a 
separate  properties  folder  where  we  define  additional  SFO 
Museum-specific  properties  that  we  want  to 

append  https : / / github . com/ sf omuseum-data/ sfomuseum-data- 
whosonf  irst  /blob/master  /utils  /python/merge-properties  to 

the  source  WOF  data.  For  example,  these  are  the  SFO  Museum 
properties  https : / / github . com/ sf omuseum-data/ sfomuseum- 
data- 

whosonf  ir st /blob /mas ter /properties/ 102/529/ 173/10252 9173 
.  j  son  for  Albuquerque  International  Airport: 


" sf omuseum: airport_id" :  1, 

" sf omuseum: name " :  "Albuquerque  International  Sunport", 
"wof : concordances " :  { 

"icaoicode":  "KABQ", 

" iata : code " :  " ABQ" 

}, 

" sf omuseum: placetype" :  "airport", 

" sf omuseum: is  sfo" :  1 


Some  of  those  properties  might  already  exist  in  the  source  WOF  data 
(like  concordances)  but  the  important  point  here  is  that  we  have  a 
way  to  stay  current  with  updates  to  data  in  Who's  On  First  while  still 
preserving  SFO  Museum's  tweaks  to  and  opinions  about  that  data. 


This  feels  like  a  small  but  meaningful  step  in  how  our  open  data 
might  play  nicely  with  other  open  data  projects,  both  conceptually 
and  at  the  level  of  brass-tacks  implementation  details. 


Paris  Charles  de  Gaulle  Airport 

This  airport  in  France  is  also  referred  to  as  CDG  or  sometimes  LFPG  /  See  all  the  airports  /  or  a  random  airport  /  or  just  SFO 


Does  the  shape  of  this  airport  look  a  bit  weird?  That's  probably  because  the  geometry  was  derived  from  the  outline  of  geotagged  Flickr 
photos  and  a  better ;  openly  licensed,  alternative  hasn't  been  found  yet. 


In  closing,  we've  added  a  random  airport 

link  https : / /mil Is fie Id . sfomuseum.org/ airports/ random/ 

so  you  can  dive  in  with  both  feet  and  see  where  the  computer-robots 
take  you.  Enjoy! 


https://millsfield.sfomuseum.org/airports/random/ 


2018-10-30 
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Sweet  spots  between  the  extremes 


Sweet  spots  between  the 
extremes 


time  converter:  Philippine  Air  Lines,  Collection  of  SFO  Museum,  2002.011 .043 


This  was  originally  published  on  the  SFO  Museum  Mills  Field 
weblog  https : //mil Is field . sf omuseum. org/blog/20 18 /I 1 /07 /raster 
zen/  ,  in  November  2018. 


This  is  a  technical  blog  post  about  map  tiles,  caching,  third-party  services,  so- 
called  "serverless"  computing  and  sustainability.  It's  also  about  improvements 


to  open-source  software  for  managing  all  of  that  stuff. 


If  you've  been  following  this 

weblog  https://miiisfieid.sfomuseum.org/biog  recently  you've  seen 
that  as  we  add  more  and  more 

elements  https : //millsf ield . sfomuseum. or g /blog /20 18/10 /17 /exhibi 
tions/  related  to  the  collection  we've  also  been  adding  more  and  more 
places  https : //millsf ield . sfomuseum. org/blog/2018/10/30/airport 
s/  at  the  same  time.  Places  out  there  in  the  real  world  where  people  live 
and  go  about  their  lives.  And  since  every  place  demands  a 
map  https : //millsf ield . sfomuseum. org/blog/2018 /08/07 /old- 
maps/  that  means  we're  adding  even  more  and  more.,  and  still  more  map 
tiles  to  the  Mills  Field  https://millsfield.sfomuseum.org/  website. 


In  the  Maps  (and  map  tiles)  at  SFO 

Museum  https:  //millsfield.  sfomuseum. org/blog/2018/07/31/maps 
/  blog  post  we  discussed  how  we're  using  a  combination  of  the  openly- 
licensed  Nextzen  https://www.nextzen.org/  map  tiles,  custom 
rendering  software  https: //github.com/whosonfirst/go-rasterzen 
and  Amazon  Web  Services'  (AWS)  infrastructure  to  produce  and  host  the 
maps  you  see  on  this  site. 


We  make  a  point  of  caching  the  custom  tiles  we  derive  from  the  source 
Nextzen  data  as  they  are  requested.  Fresh  tiles,  though,  can  sometimes  take  a 
while  to  render  particularly  in  dense  areas  like  big  cities  with  a  lot  of  map 
data  to  process.  A  lot  of  airports  are  near  big  cities  and  since  we  care  a  lot 
about  airports  https://millsfield.sfomuseum.org/airports  this  is 
not  an  abstract  problem  for  us. 


airmail  flight  cover:  Pan  American  World  Airways,  Malaysia,  Collection  ofSFO  Museum,  2012.149.2336  a  b 


To  address  this  issue  we've  added  a  new  command-line  tool  to  the  go- 
rasterzen  https://github.com/whosonfirst/go-rasterzen  code  that 
allows  us  to  pre-seed  all  of  the  tiles,  at  all  of  the  different  zoom  levels, 
needed  to  show  a  place  on  the  map. 

We've  named  this  tool  rasterzen-seed  and  it  can  be  configured  to 
process  tiles  locally  or  to  invoke  an  instance  of  the  rasterd  tool  configured 

as  an  AWS  Lambda 

function  https://en.wikipedia.org/wiki/AWS_Lambda  allowing  all  the 
heavy  lifting  to  happen  on  a  computer  in  Amazon's  data  centers. 


Here  is  how  you  would  run  the  tool  to  pre -render  all  the  tiles  between  zoom 
levels  2  and  16,  locally  on  your  own  computer,  for  the  bounding  box 
containing  the  larger  SFO 

Campus  https://millsfield.sfomuseum.org/airports/SFO/  Storing 
the  output  in  a  folder  named  cache: 


$>  . /bin/rasterzen-seed  -mode  extent  -extent  '-122.405228  37.604481  -122.355044  37.645194'  \ 
-min-zoom  2  -max-zoom  16  -nextzen-apikey  {NEXTZEN_APIKEY}  \ 

-fs-cache  -fs-root  cache  \ 

-seed-worker  local  -seed-png  \ 

-timings 

15:50:58.758870  [ rasterzen-seed]  STATUS  Time  to  seed  tile  (16/10489/25361)  948.46835ms 
15:50:58.802463  [rasterzen-seed]  STATUS  Time  to  seed  tile  (16/10488/25364)  1.600181332s 
. . .  and  so  on 


And  here's  how  you  would  do  the  same  thing  but  off-loading  all  of  the  work 
to  an  AWS  Lambda  function  https  :  //github .  com/whosonf  irst/go- 
rasterzen#lambda  : 


$>  ./bin/rasterzen-seed  -mode  extent  -extent  '-122.405228  37.604481  -122.355044  37.645194'  \ 
-min-zoom  2  -max-zoom  16  -nextzen-apikey  {NEXTZEN_APIKEY}  \ 

-seed-worker  lambda  -lambda-function  {LAMBDA_FUNCTION}  \ 

-lambda-dsn  ' credent ials={AWS_CREDENTIALS}  region={AWS_REGION} '  \ 

-fs-cache  -fs-root  cache  \ 

-seed-png  \ 

-timings 

15:50:58.987440  [rasterzen-seed]  STATUS  Time  to  seed  tile  (16/10484/25364)  4.284314644s 

15:50:59.010505  [rasterzen-seed]  STATUS  Time  to  seed  tile  (15/5246/12684)  4.306041647s 

15:50:59.010518  [rasterzen-seed]  STATUS  Time  to  seed  tile  (15/5242/12685)  1.222798064s 

. . .  and  so  on 


Going  forward  there  are  plans  to  integrate  Amazon's  queuing 
Service  https://github.eom/whosonfirst/go-rasterzen/issues/7  in 
to  the  workflow  so  that  even  more  of  the  process  can  be  handled 
independently  of  local  computing  resources. 


The  rasterzen-seed  tool  is  configured  to  store  tiles  in  variety  in  cache 
sources  but  a  cache  source  is  not  required.  If  you  don't  specify  any  caching 
sources  as  we  did  in  the  example  above  then  tiles  will  be  rendered  and 
cached  (or  not)  depending  on  how  your  Lambda  function  is 
Configured  https://github.eom/whosonfirst/go-rasterzen#lambda  , 
typically  to  an  (AWS)  S3 

bucket  https://en.wikipedia.org/wiki/AWS_S3 


negative:  San  Francisco  International  Airport  (SFO),  noise  abatement  equipment,  Collection  ofSFO  Museum,  2011.0322445 


You  might  do  this  if  you  want  to  serve  PNG  tiles  but  don't  want  to  suffer  the 
sometimes  very  long  wait  times  generating  raster  tiles  in  production  or  you 
don't  want  to  deal  with  the  hoop-jumping  around  AWS  Lambda  /  API 


Gateway  integrations,  particularly  when  it  involves 

images  https : //github . com/whosonf irst/go-rasterzen#lambda-api- 
gateway-and-images  .  Instead  you  could  use  raster zen-seed  to  pre¬ 
render  all  your  tiles  ahead  of  time  and  serve  them  as  static  files  from  an  S3 
bucket  configured  as  a  website. 


Maps  are  about  places  and  as  we've  discussed  in  earlier  blog  posts  SFO 

Museum  uses  and  extends  the  Who's  On  First 

gazetteer  https : //mills field . sf omuseum. org/blog/2018 /08 /28/whoson 
first/  (WOF)  to  describe  the  places  it  cares  about.  By  design,  the  go- 
raster  zen  code  doesn't  know  anything  about  the  Who's  On  First  gazetteer. 
All  it  knows  about  are  individual  map  tiles  and  geographic  extents. 


We've  built  a  second  tool  on  top  of  the  rasterzen-seed  tool  for  pre¬ 
rendering  tiles  for  an  index  of  Who’s  Of  First  documents.  It’s  called...  wof- 
rasterzen-seed  https : //github . com/whosonf irst/go-whosonf irst- 
rasterzen#wof-rasterzen-seed  . 


The  wof-rasterzen-seed  can  consume  any  index  supported  by  the  go- 
whosonfirst-index  https : //github . com/whosonf irst/go-whosonf irst- 
index  package  but  for  our  purposes  the  most  common  index  is  a  Git 
repository  https://github.com/sfomuseum-data  containing  Who's  On 
First  records.  Here's  how  we  would  seed  all  the  tiles  between  zoom  levels  5 
and  8  for  every  place  in  the  sfomuseum-data- 

architecture  https : //github. com/sfomuseum-data/sfomuseum-data- 
architecture  repository: 


.  /bin/wof-rasterzen-seed  -nextzen-apikey  {NEXTZEN_APIKEY}  \ 

-seed-worker  lambda  -lambda-function  {LAMBDA_FUNCTION}  \ 
-lambda-dsn  ' credentials={AWS_CREDENTIALS}  region={AWS_REGION} '  \ 


-min-zoom  5  -max-zoom  8  -timings  -seed-all  \ 

-mode  repo  /usr/local/data/sfomuseum-data-architecture 


The  only  difference  from  the  rasterzen-seed  tool  is  the  change  in  the 
semantics  of  the  -mode  command  line  flag. 


Under  the  hood  the  wof-rasterzen-seed  handles  all  the  work  of 
reading  Who's  On  First  documents  and  calculating  the  bounding  boxes  that 
are  used  to  determine  which  map  tiles  to  fetch.  For  example  wof- 
rasterzen-seed  will  generate  separate  extents  for  the  continental 
US  https://spelunker.whosonfirst.org/id/85633793/  and 
Hawaii  https://spelunker.whosonfirst.org/id/85688671/  and 
Puerto  Rico  https: //spelunker .whosonfirst.org/id/85633729/ 
rather  than  a  single  bounding  box  that  combines  all  three  places  since  that 
would  contain...  a  lot  of 

water  https://spelunker.whosonfirst.org/id/404528711/ 


We  hope  you  find  rasterzen-seed  (and  wof-rasterzen-seed)  as 
useful  as  we  do.  They  are  tools  that  allow  us  to  take  advantage  of  third-party 
services  without  going  "all  in"  and  running  the  risk  of  getting  trapped. 


The  map  tiles  produced  by  the  Nextzen  https://www.nextzen.org/ 
project  are  born  out  of  the  hard  experiences  of  similar  past  services  shutting 
down  or  being  commercialized  so  there  is  little  immediate  chance  of  them 
going  away.  Likewise  it's  almost  impossible,  in  2018,  to  imagine  Amazon 
disappearing  and  the  promises  and  benefits  of  so-called  "cloud"  and 
"serverless" 

Computing  http:  /  /blog . cleverelephant . ca/2  015/10/keynote-at- 
foss4g-20i5.html  are  both  real  and  tangible. 


Tomorrow  always  remains  a  mystery,  though.  Whether  it's  a  lapsed  domain 
name  renewal  taking  a  service  offline  or  a  change  in  pricing  or  terms  of  use 
making  a  service  unaffordable  or  a  service  simply  being 
SUnsetted  https : //blog . pinboard . in/201 1 /03 /anatomy_of _a_crushin 
g/  building  on  top  of  "platforms"  is  always 

risky  https://en.wikipedia.org/wiki/Vendor_lock-in  .  Risky  and 
often  prohibitively  expensive  depending  on  how  hard  it  is,  or  how  long  it 
takes,  to  mitigate  a  service  you  rely  on  no  longer  being  so  reliable. 


master  plan  and  report:  San  Francisco  International  Airport  (SFO),  Dreyfus s  &  Blackford,  Quinton  Engineers,  Ltd.,  Collection  of  SFO 

Museum,  2002.040.002 


Pre-caching  tiles  doesn't  necessarily  mean  we'll  have  a  copy  of  the  entire 
planet  if  Nextzen  closes  its  doors  but  it  does  mean  that  we  should  have  copies 
of  the  tiles  that  are  immediately  relevant  to  our  work.  Likewise  processing 
and  rendering  those  tiles  locally  might  take  a  lot  longer  than  it  does  doing  the 
same  work  in  Amazon's  "cloud"  but  that  is  very  different  than  not  being  able 
to  do  that  work  at  all  because  the  logic  for  doing  so  can't  be  untangled  from  a 
third  party's  infrastructure. 


There  are  sweet  spots  between  all  of  these  extremes  and  one  of  the  goals  with 
tools  like  gO-rasterzen  https://github.com/whosonfirst/go- 
rasterzen  is  to  help  find  them.  These  are  small  steps,  for  sure,  but  it  is 
forward  momentum  all  the  same. 


2018-11-07 


things  I  have  written  about 
elsewhere  #201 81 203 

Airlines  and  Companies  and  Enterprises  (and... 


Airlines  and  Companies  and 
Enterprises  (and 
Concordances) 


photo:  Alaska  Airlines,  Korean  Air,  Southwest  Airlines  planes  taxiing,  SFO,  2018 


This  was  originally  published  on  the  SFO  Museum  Mills  Field 

Weblog  https : / /mills field . sfomuseum.org/blog/2018/12/ 03/ 
airlines/  ,  in  December  20 1 8 . 

All  of  the  work  that  SFO  Museum  is  doing  around  "the  digital"  is 
being  done  with  an  ambitious  but  ultimately  pretty  simple  end- goal: 


To  ensure  that  every  aspect  of  a  trip  to  SFO,  and  every  facet  of 
someone's  time  spent  in  the  airport,  leads  back  to  the  museum's 
collection. 


SFO  Museum  is  actually  a  museum,  with  both  temporary  and 
permanent  collections,  and  a  library  and  an  archive.  These  are  all 
important  and  meaningful  distinctions  to  people  who  work  in  the 
cultural  heritage  sector.  The  distinctions  represent  unique  areas  of 
focus,  expertise  and  responsibility. 


At  the  same  time  people  not  working  in  the  cultural  heritage,  and 
busy  being  awesome  in  their  own  endeavours,  don't  split  these 
particular  hairs  the  same  way  we  do  so  when  I  talk  about  "the 
collection"  I  am  talking  about  everything  that  SFO  Museum  does, 
everything  that  we  produce,  everything  that  we  care  for. 


The  lovely  thing  about  a  museum  like  ours  is  that  the  visitors,  all 
the  places  they  travel  to  and  from  and  airplanes  and  airlines  that 
are  taking  them  are  the  glue  that  connects  the  museum's 
different  efforts. 


This  is  especially  true  of  the  museum's  permanent 
collection  https  :  / /www.  f  lysfo .  com/museum/ aviation-museum- 
library/collection  whose  focus  is  "the  history  of  commercial 
air  transport,  the  airline  industry,  and  San  Francisco  International 
Airport  with  a  regional  emphasis  on  the  West  Coast  and  the  Pacific 


Rim "  and  the  temporary 

exhibitions  https : / /millsf ield. sfomuseum.org/exhibitions 
/years/  which  are  a  delightful  mix  of  everything  under  the  sun 
and  are  composed  largely  of  objects  borrowed  from  public  and 
private  lenders. 


Place  and  travel  connect  them  since  every -thing  is  from  som e-place 
and  people,  the  passengers  and  visitors  passing  through  the  airport, 
are  what  connect  travel  and  place.  That's  why  we've  started  our 
efforts  at  the  macro  level,  with  maps  of  the  world  and  the 
infrastructure  to  sustain  them,  and  have  been  getting  closer  and 
closer  to  the  micro  level,  the  objects  in  our  collection,  with  work  to 
model  buildings  down  to  individual  galleries. 


coaster:  KLM  (Royal  Dutch  Airlines),  Collection  ofSFO  Museum,  1997.54.02 


To  that  end  we've  updated  the  Mills  Field  website  and  our  publicly 
available  datasets  to  include  two  more  "first-class"  actors  in  how  the 
museum  understands  its  work: 

Airlines  https://millsfield.sfomuseum.org/airlines/  and 
Companies  https://millsfield.sfomuseum.org/companies/ 


https  ://millsfield  .sfomuseum  .org/ airlines/ 


https  ://millsfield  .sfomuseum  .or  g/companies/ 


Actually,  it's  really  airlines  and  companies  and 

agencies  https://millsfield.sfomuseum.org/agencies/  to 

account  for  things  like 

NASA  https  :  /  /mills  field .  sfomuseum.org/agencies/115  92  8 
7  041/  or  a  national  postal 

Service  https : / /millsf ield . sfomuseum.org/ agencies /1 1592  8 
8179/  . 


https  ://millsfield  .sfomuseum  .or  g/ agencies/ 


The  generic  catch-all  term  for  all  three  is  that  each  is  an  " enterprise". 
As  in: 


From  WordNet  (r)  3.0  (2006)  [wn] : 
enterprise 

n  Is  a  purposeful  or  industrious  undertaking  (especially  one 

that  requires  effort  or  boldness);  "he  had  doubts  about  the 
whole  enterprise"  [syn:  {enterprise},  {endeavor}, 
{endeavour} ] 

2:  an  organization  created  for  business  ventures;  "a  growing 
enterprise  must  have  a  bold  leader" 

3:  readiness  to  embark  on  bold  new  ventures  [syn:  {enterprise}, 
{enterprisingness},  {initiative},  {go-ahead}] 


This  is  relevant  because  like  all  the  other  datasets  we've  published 
we've  modeled  "enterprises"  as  Who's  On  First 
documents  https  :  / /millsf  ield .  sfomuseum.org/blog/2  018/ 08 / 
2  8/whosonf  irst/  .  Which,  let's  be  clear,  is  not  really  what  Who's 


On  First  was  designed  for.  This  is  SFO  Museum  piggy-backing  on 
an  existing  project  and  then  starting  to  push  it  in  new  directions  to 
suit  our  needs. 

Who's  On  First  https://www.whosonfirst.org/what  was 
created  to  tackle  the  problem  of  place,  not  of  things  or  topics  or  of 
human  enterprise.  Still  most  activity  can,  at  some  level,  be  localized 
to  place  even  if  it's  something  as  mundane  or  uninspiring  as  where  a 
company  is  headquartered  or  where  it  file  income  taxes.  For 
example,  regardless  of  where  anything  that  Apple  sells  is  actually 
manufactured  (and  it's  an  open  question  whether  or  not  that  includes 
the  USA)  those  products  are  said  to  be  "designed  in 
California  https  :  / /millsf  ield .  sfomuseum.org/ search/ ? 
q=calif ornia&placetype=exhibition  " . 


photo:  Air  China  Boeing  747  taking  off,  SFO,  2018 


It  also  helps  that  airlines  fly  to  ...  well,  places. 


So  what  we've  said  is  that  an  airline,  or  a  company  or  an  agency  (an 
enterprise)  is  "parented"  by  the  place  that  it  is  "from"  which  means  a 
locality  or  at  least  a  country.  For  example  Continental 
Airlines  https : //mil Is field . sfomuseum.org/ airlines/ 1 1592 
84  2  03/  is  parented  by  the  city  of 

Houston  https : //mil Is field . sfomuseum.org/ id/ 10 172 5 62 9/ 

and  African  Safari 

Airways  https: / /millsfield. sfomuseum.org/companies/11592 
8  7511/  is  parented  by  the  country  of 

Kenya  https : / /millsfield . sfomuseum.org/countries /85632 
32  9/  .It  also  means  we  can  quickly  and  easily  show  you  all  the 

airlines  from 

Bangladesh  https : //millsfield . sfomuseum.org/countries/85 
632475/airlines/  or  any  of  the  other 

Countries  https : / /millsfield . sfomuseum.org/countries/ 
that  intersect  with  our  collection. 


SFO  Museum  distinguishes  between  airlines  and  companies  partly  to 
reflect  how  the  curators  and  registrars  have  thought  about  this  in  the 
past  and  also  to  reflect  the  reality  that  in  an  era  of  multinational 
companies  and  conglomerates  individual  airlines  are  just  facets  of  a 
larger  whole.  British 

Airways  https : / /millsfield . sfomuseum.org/airlines/11592 
84021/  ,Aer 


Lingus  https : / /millsfield. sfomuseum.org/airlines/115928 

3495/  and 

Iberia  https : / /millsfield . sfomuseum.org/ airlines/ 1 15  92  84 
511/  ,  among  others,  are  all  distinct  airlines  that  people  know  and 
recognize  that  also  happen  to  be  owned  by  something  called 

International  Airlines 

Group  https : / /en. wikipedia.org/wiki/International_Airlin 
es_Group  which  has  never  graced  an  airport  departures  board,  so 
far  as  I  know. 


airline  diagram:  Delta  Air  Lines,  Family  tree,  SFO  Museum,  2013 .140 .011 


We  see  the  same  thing  happening  in  the  museum  sector. 

MoMA  https  : / /en. wikipedia.org/wiki/Museum_of_Modern_A 
rt  is  no  longer  just  a  single  museum,  but  a 
museum  https://www.moma.org/  and  a  second 
museum  http :  /  /www .  momaps  l .  org/  in  Queens  and  a  number  of 
stores  https://store.moma.org/  in  New  York  City  and  Japan. 
The  same  is  true  of  the  Metropolitan  Museum  of 
Art  https://www.metmuseum.org/  with  its  three  museums  in 
New  York  or  the  Smithsonian  https :  /  /si . edu  with  its  many 
"units"  https://www.si.edu/museums  (that's  really  what  they 
call  them...)  on  The  Mall  in  Washington.  The  list  goes  on. 

Our  airlines  and  companies  data  does  not  always  not  reflect  all  the 
ways  the  airline  industry  has  grown  and  morphed  over  the  years  but 
our  goal  is,  in  time,  to  capture  that  history.  Like  most  museum 
collections  metadata  there  will  be  some  hiccups.  It's  important  to 
point  out  that  this  is  nothing  more  than  a  reflection  of  the  constraints 
and  demands  that  the  sector  has  operated  under  long  before  the 
Internet  and  all  its  fancy  databases  showed  up.  It  is  important  to 
remember  that  everything  we,  and  every  other  museum  out  there, 
have  accomplished  has  been  done  on  the  back  of  this  "imperfect" 
data. 


model  aircraft:  Republic  Airlines,  Boeing  727-200,  Collection  ofSFO  Museum,  2002.013.003  a  b 


Where  there  is  "weird  data"  we  will  fix  it.  The  ability  of  the  Who's 
On  First  model  to  inaccurate  or  bunk  data  as  well  as  the  evolution 
and  changes  of  a  place,  by  way  of  the  wof :  deprecated  and 
wof  :  super  seded_by  properties,  were  important  considerations 
when  choosing  to  use  it  to  describe  airlines  and  companies. 


So,  how  do  we  model  the  relationship  between  airlines  and 
companies? 

In  addition  to  adding  a  non-standard  Who's  On  First  (WOF) 
placetype  wof :  placetype=enterprise  we've  added  a  non- 


standard  WOF  property  called  wof  :  subordinate_of .  As  in: 


From  The  Collaborative  International  Dictionary  of  English  v.0.48  [gcide]: 

Dependent  \De*pend"ent\ ,  a.  [L.  dependens,  -entis,  p.  pr. 
dependere.  See  {Depend},  and  cf.  {Dependant}.] 

2.  Relying  on,  or  subject  to,  something  else  for  support;  not 
able  to  exist,  or  sustain  itself,  or  to  perform  anything, 
without  the  will,  power,  or  aid  of  something  else;  not 
self-sustaining;  subordinate; 

3.  conditional;  contingent  or  conditioned.  Opposite  of 
{unconditional} . 

Unfortunately,  many  definitions  of  the  term  subordinate  talk 
about  things  (that  are  subordinate)  being  "lower  class"  or  "inferior 
to"  or  even  "parasitical"  which  is  not  the  tenor  we're  aiming  for.  We 
use  the  term  to  denote  a  ranked  or  ordered  grouping,  where  one  or 
more  items  are  dependent  on  another.  To  recap: 

•  Airlines  are  parented  by  localities  or  countries  and 
subordinate  to  companies. 

•  Companies  are  parented  by  localities  or  countries  and 
may  be  subordinate  to  other  companies. 

There  is  no  limit  on  how  deep  a  subordinate-chain  can  be  or  its 
directionality  which,  it  seems,  is  a  pretty  accurate  reflection  of  how 
companies  and  corporations  actually  structure 

themselves  https : / /en. wikipedia.org/wiki/List_of_Nestl%C3 
%A9_brands  in  2018.  There  will  undoubtedly  be  some  edge  cases 


whose  nuances  we  don't  capture  with  this  model  but  it  feels  like  a 
good  place  to  start. 


playing  cards:  CAAC  (Civil  Aviation  Administration  of  China),  Collection  of  SFO  Museum  ,  2010.104.185 


Like  other  datasets,  all  of  the  enterprises  in  the  SFO  Museum 
collection  are  published  under  an  open  data  (CDLA)  license: 

https ://  github  .com/ sfomuseum-data/ sfomuseum-  data- enterprise 


And  we  have  updated  the  sf  omuseum-data-whosonf  irst 
dataset  to  include  all  the  new  places  (from  Who's  On  First)  that  these 
enterprises  are  from: 


https  ://github  .com/ sfomuseum-data/ sfomuseum-data- 

whosonfirst 


Sometimes,  the  question  is  asked:  Why  not  just  use  Wikipedia  (or 
Wikidata)  identifiers  for  things  like  airlines  or  companies  rather  than 
minting  even  more  pointers  for  what  everyone  agrees  is  the  same 
thing?  It's  a  reasonable  question  given  the  scope  and  the  breadth,  not 
to  mention  the  quality,  of  the  work  Wikipedians  have  done.  This  is 
especially  true  when  it  comes  to 

airlines  https://en.wikipedia.org/wiki/Airline  ,  it  turns  out. 


The  answer  is  as  important  as  it  is  simple:  SFO  Museum  needs  to  be 
its  own  "source  of  truth"  for  the  things  in  its  collection.  At  the  end  of 
the  day  we  need  to  be  able  to  exercise  editorial  interpretation,  bias 
and  control  according  the  museum's  own  guidelines  and  practices 
and  not  those  of  Wikipedia  (or  any  other  project).  We  also  need  a 


measure  of  stability  in  our  identifiers  that  other  sources  may  not  need 
or  provide.  For  example,  I  recently  discovered  that  the  same 

International  Air  Transport 

Association  https  :  //WWW.  iata.org/publications/Pages/code- 
search. aspx  (IATA)  code  (CZ)  has  been  used  for  two  completely 
different 

airlines  https : / /millsf ield . sfomuseum.org/ concordances/ iat 
a :  code=cz  ,  over  the  years. 


airline  bag:  CP  Air  (Canadian  Pacific  Airlines ),  Collection  ofSFO  Museum ,  2001.146.095 


But  we  would  like  to  hold  hands  with  Wikipedia  and,  where  it's 
appropriate,  we  would  be  happy  and  grateful  to  supplement  missing 
data  in  our  records  with  data  from  their  records,  notably  dates  and 
international  aviation  codes  and  callsigns. 


https://millsfield.sfomuseum.org/concordances/wikipedia/ 


https://millsfield.sfomuseum.org/concordances/wikidata/ 


Thanks,  Wikipedians!  You  can  see  the  list  of  all  the  known 
concordances,  with  Wikipedia  and  other  sources,  for  everything  in 
our  collection  here: 

https  ://millsfield  .sfomuseum  .or  g/concor  dances/ 


Concordances  are  great  because,  selfishly,  we  want  to  make  sure  that 
even  if  you  don't  use  our  identifiers  for  things  that  there  is  always  an 
avenue  back  to  our  collection  regardless  of  whose  identifiers  you 
use.  They  are  also  great  because  debate,  in  the  friendliest  sense  of 
the  word,  is  the  bed  rock  on  which  cultural  heritage  and  the 
humanities  rests  so  offering  people  a  multiplicity  of  views  and 
understandings  about  a  subject  seems  like  a  generally  good  thing. 


SEARCH  THE  COLLECTION  Q.  MENU  = 


YOURSTUFF-  TOYS-  EXPLORE  THE  COLLECTION- 


RANDOM 


AMERICAN  AIRLINES 


o 

o 


We  have  41  objects  that  American  Airlines  has  been  involved 
with. 


American  Airlines  has 
collaborated  with  Edward 
McKnight  Kauffer  and  Mr. 
Bernard  Waldman  and  New 
York  Subway  Advertising 
Company. 


Works  by  American  Airlines 
have  made  their  way  in  to  the 
museum  with  the  help  of  Mrs. 
Edward  McKnight  Kauffer  and 
Virginia  McBride  and  Kristina 
Parsons  and  Julia  Pastor. 


Short  URL:  http://cprhw.tt 
/p/2Avyz/ 

Person  ID:  18049749 

Tag  as:  ch:person=18049749 


We  especially  want  to  establish  concordances  between  our  collection 
and  other  museum  collections.  For  example,  we  know  that  our 

American 

Airlines  https : / /millsf ield . sfomuseum.org/companies/ 1360 
700619/  is  the  same  as  the  Cooper 

Hewitt  https://cooperhewitt.org/  Smithsonian  Design 


Museum's  American 

Airlines  https : / / collection . cooperhewitt . org /people/ 1804 
9749/  and  they  have  all  kinds  of  amazing 

objects  https : / /collection . cooperhewitt . org/people/180497 
49/objects/  related  to  the  airline  in  their  collection.  Why 
wouldn't  we  want  you  to  see  them? 


mills  field  |  collection  |  map  |  blog  |  random 

search  SFO  Museum 


Concordances  with  Cooper  Hewitt  Smithsonian 
Design  Museum  identifiers 

See  all  the  sources  that  SFO  Museum  has  concordances  with 


American  Airlines 

This  is  a  company  and  its  Cooper  Hewitt  Smithsonian  Design  Museum  identifier  is  18049749  /  we  reference  it  as 

chsdm:person=1 8049749 

Lufthansa  German  Airlines 

This  is  a  company  and  its  Cooper  Hewitt  Smithsonian  Design  Museum  identifier  is  18045975  /  we  reference  it  as 

chsdm:person=1 8045975. 

Scandinavian  Airlines  System  (SAS) 

This  is  a  company  and  its  Cooper  Hewitt  Smithsonian  Design  Museum  identifier  is  18536227  /  we  reference  it  as 

chsdm:person=1 8536227 


We  haven't  talked  much  about  people  in  the  context  of  the  SFO 
Museum  collection  yet  but  like  the  objects  they  are  involved  they  are 
radidly  coming  in  to  focus.  Alexander 


Girard  https : / /WWW. f lysf o . com/museum/ exhibitions /2 1649/ 
detail  (not  mention  Braniff 

Airways  https : / /millsf ield . sfomuseum.org/ search/ ? 
q=Branif f &placetype=exhibition  )  for  instance. 


Braniff  Airways  Model  66310  Armchair,  ca.  1968,  Alexander  Hayden  Girard,  Cooper  Hewitt  Smithsonian  Design 

Museum,  2013-37-1 


Business  Card,  Federal  Aviation  Agency,  1962,  Cooper  Hewitt,  Smithsonian  Design  Museum,  1993-31-105-2 


We  think  "people"  (and  airlines  and  companies)  will  be  an  especially 
rich  source  of  concordances  with  other  museum  collections  given 
how  many  individuals,  from  all  walks  of  life  and  across  the 
professional  spectrum,  are  involved  with  all  the  things  SFO  Museum 
is  interested  in. 


photo:  Virgin  Atlantic  https://millsfield.sfomuseum.org/airlines/1159288557/  and  British 
Airways  https :  //milisf  ieid.  sfomuseum.org/airiines/H5928402i/  planes,  International  Terminal 
A  https://millsfield.sfomuseum.org/boardingareas/1159396177/  , 

SFO  https://millsfield.sfomuseum.org/airports/SFO/  ,2018 

Next  up...  aircraft! 


2018-12-03 


